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\0 (54) Title: TRANSGENIC ANIMALS FOR ANALYSING CYP3A4 CYTOCHROME P450 GENE REGULATION 
JO 

£J (57) Abstract: The invention relates to the generation of non-human transgenic animals comprising a reporter construct for produc- 
ing a detectable amount of a reporter molecule operably linked to a transcriptional regulatory nucleic acid molecule from the human 
Q CYP3A4 gene located between the initiation of transcription site of the gene and a position located 13,000 nucleotides upstream 
from the site. The invention also relates to the use of these animals for determining the effect of a compound, particularly, but not 
exclusively, a xenobiotic or steriod, on the regulation of expression of the CYP3 A4 gene in a human. 
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TECHNICAL FIELD OP THE INVENTION 

The invention relates to the generation of a transgenic 
animal and to the use of the animal for determining the 
effect of a compound, particularly, but not exclusively, a 
xenobiotic or steroid, on the regulation of expression of 
a P450 gene in a human. 

BACKGROUND OP THE INVENTION 



Many endogenous and exogenous compounds are observed to 
have a therapeutic effect in drug development trials in 
vitro. However, the intended therapeutic effect is often 

15 not realised in clinical practice, for example, when 

compounds are co- administered, because certain compounds 
induce the expression of the CYP3A4 gene. This induction 
generates CYP3A4 cytochrome P450 molecules which 
metabolise compounds before the intended therapeutic 

20 effect of each compound can be realised. Accordingly, 

induction of expression of the CYP3A4 gene interferes with 
intended dosage, leading to therapeutic failure or sub- 
optimal treatment. 



25 Induction of CYP3A4 gene expression is a significant 

problem for drug development because time, resources and 
expense are wasted in the development of candidate drugs 
for therapy of particular disease conditions which will 
ultimately fail or perform sub-optimally in clinical 

30 practice. 



It would be advantageous to have an animal model for use 
in drug development trials from which, at an early stage 
of drug development, one could determine whether a 
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candidate drug would be likely to achieve an intended 
therapeutic effect in a human. 

Such an animal model would not be useful unless at least 
5 some of the aspects of the regulation of CYP3A4 gene 
expression in the human, especially tissue specific 
expression, are reproduced. This is because in the human, 
•the CYP3A4 gene is expressed in specific tissues, 
including liver and small intestine, which many compounds 

10 inevitably come into contact with when administered for 
the purpose of therapy. Accordingly, one would be unable 
to determine whether the bio-availability of a candidate 
drug would be sufficient for achieving an intended 
therapeutic effect in clinical practice in a model which 

15 does not reproduce the constitutive and xenobiotic induced 
tissue specific expression of the CYP3A4 gene that is 
observed in the human. 

W099/61622 and Goodwin et al. 1999 disclose a nucleic acid 
20 molecule located 8 kb upstream from the initiation of 
transcription site of the CYP3A4 gene which regulates 
transcription of the CYP3A4 gene in response to xenobiotic 
compounds. These documents do not disclose elements for 
regulating the constitutive and xenobiotic inducible 
25 tissue specific and developmental expression of the CYP3A4 
gene observed in a human. 

There is a need for an animal model which reproduces at 
least some aspects of the expression of the CYP3A4 gene in 
30 a human, for determining whether a compound, for example, 
one identified in a drug development trial, would be 
likely to induce CYP3A4, and hence cause drug-drug 
interactions, or auto-induction of the metabolism of the 
drug under study. 
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DESCRIPTION OP THE INVENTION 

The invention seeks to address the above identified need 
and in a first aspect provides a non-human mammal 
comprising : 

5 

(a) a regulatory nucleic acid molecule which is capable 
of regulating transcription of the human CYP3A4 gene and 
which comprises a nucleotide sequence that is identical to 
a sequence of the human CYP3A4 gene located between the 

10 initiation of transcription site of the gene and a 

position located at least 13,000 nucleotides upstream from 
the site; and 

(b) a reporter nucleic acid molecule for producing a 
15 detectable amount of a reporter molecule for indicating 

regulation of transcription of the reporter nucleic acid 
molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
20 are arranged to permit the regulatory nucleic acid 

molecule to regulate transcription of the reporter nucleic 
acid molecule . 

As described herein, the inventors have found that the 
25 incorporation of a region of the human CYP3A4 gene that is 
located between the initiation of transcription site of 
the gene and a position 13,000 nucleotides upstream of the 
initiation of transcription site into an animal model 
provides the animal with sufficient genetic information 
30 for reproducing the constitutive and xenobiotic induced 
tissue specific expression of the CYP3A4 gene that is 
observed in humans. More specifically, the inventors have 
generated animal models which contain a transgene 
comprising this region and have observed that these models 
35 provide constitutive arid xenobiotic inducible' expression 
of a transgene in a tissue pattern which reproduces the 
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tissue specific expression of CYP3A4 which is observed in 
a human. Importantly, the level of constitutive 
expression is sufficient to allow one to observe the 
effect on the regulation of tissue specific transgene 
5 expression, of administration of a compound, for example, 
a xenobiotic or steroid, to the animal. 

Further, the inventors have observed that the animal 
models described herein also reproduce aspects of the 
10 constitutive and xenobiotic inducible developmental 

expression of the CYP3A4 gene that is observed in humans. 

These findings are unanticipated because prior to the 
invention, there was no suggestion that the genetic 

15 information required for simulating the constitutive and 
xenobiotic induced tissue specific or developmental 
expression of the CYP3A4 gene that is observed in a human 
would be contained in the region of the human CYP3A4 gene 
between the initiation of transcription site of the gene 

20 and a position 13,000 nucleotides upstream of the 
initiation of transcription site. 

Further, prior to the invention, differences in the 
induction profile of the mouse CYP3A11 and the human 

25 CYP3A4 gene had been observed, and differences had also 
been observed in the ligand binding profile of mouse 
transcription factors, especially PXR and CAR, and human 
PXR and CAR. Accordingly, there was no suggestion that a 
non-human animal would have factors sufficient for 

30 interacting with a region of the CYP3A4 gene for 

reproducing the constitutive and xenobiotic induced tissue 
specific or developmental expression of CYP3A4 observed in 
a human. 

35 Further, prior to the invention,. mechanisms^ associated 

with transgene integration had been observed, such as gene 
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silencing and mosaic transgene expression which limited 
the extent to which an a transcriptional enhancer element 
incorporated into a trangenic model could reproduce 
regulation of gene expression observed in a human. 
5 Accordingly, there was no suggestion that a region of the 
human CYP3A4 gene would be capable of reproducing the 
regulation of expression of the CYP3A4 gene that is 
observed in a human. However, as described herein, the 
inventors have shown in 2 separate founder lines that the 
10 expression of the transgene reproduces aspects of CYP3A4 
gene expression that are observed in humans. 

Thus in a second aspect, the invention provides a non 
human mammal comprising: 

15 

(a) a regulatory nucleic acid molecule comprising a 
nucleotide sequence that is identical to the nucleotide 
sequence of the human CYP3A4 gene that extends about 
13,000 nucleotides upstream from the initiation of 

20 transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
detectable amount of a reporter molecule for indicating 
regulation of transcription of the reporter nucleic acid 

25 molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
are arranged to permit the regulatory nucleic acid 
molecule to regulate transcription of the reporter nucleic 
30 acid molecule. 

In one embodiment, the regulatory nucleic acid molecule 
comprises the sequence shown in SEQ ID NO:l. 

35 Further, as described. herein, the inventors have generated 
transgenic animals which contain a region of the human 
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CYP3A4 gene between the initiation of transcription site 
and a position about 3,200 nucleotides upstream of the 
initiation transcription site and observed that the 
transgene is not constitutively expressed or inducible by 
5 xenobiotics in these animals. Accordingly, the inventors 
have found that the genetic information required for 
reproducing the constitutive and xenobiotic induced tissue 
specific and developmental expression of CYP3A4 observed 
in a human is contained in the region of the human CYP3A4 
10 gene between the position located about 3,200 nucleotides 
upstream of the initiation of transcription site of the 
gene and a position 13,000 nucleotides upstream of the 
initiation of transcription site. 

15 Thus, in a third aspect, the invention provides a non- 
human mammal comprising: 

(a) a regulatory nucleic acid molecule comprising a 
nucleotide sequence that is identical to the sequence of 

20 the human CYP3A4 gene that extends about 8,000 nucleotides 
upstream from a position about 3,000 nucleotides upstream 
from the initiation of transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
25 detectable amount of a reporter molecule for indicating 

regulation of transcription of the reporter nucleic acid 
molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
30 are arranged to permit the regulatory nucleic acid 

molecule to regulate transcription of the reporter nucleic 
acid molecule. 

In one embodiment, the regulatory nucleic acid molecule 
35 comprises the sequence shown . in. SEQ ID NO: 2, 
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In a fourth aspect, the invention provides a non-human 
mammal comprising: 

(a) a regulatory nucleic acid molecule which is capable 
5 of regulating transcription of the human CYP3A4 gene and 

which comprises a nucleotide sequence that is identical to 
the sequence of the human CYP3A4 gene that extends about 
-600 nucleotides upstream from a position about 7,200 
nucleotides upstream of the initiation of transcription 
10 site of the gene; and 

(b) a reporter nucleic acid molecule for producing a 
detectable amount of a reporter molecule for indicating 
regulation of transcription of the reporter nucleic acid 

15 molecule by the regulatory nucleic acid molecule 

wherein the reporter and regulatory nucleic acid molecules 
are arranged to permit the regulatory nucleic acid 
molecule to regulate transcription of the reporter nucleic 
20 acid molecule. 

In one embodiment, the regulatory nucleic acid molecule 
comprises the sequence shown in SEQ ID NO: 3. 

25 In another embodiment, the regulatory nucleic acid 

molecule has the sequence of any one of the following 
fragments of the CYP3A4 gene: 

(i) a fragment consisting of from nucleotide positions 
-13,000 to +53; 

30 (ii) a fragment consisting of from nucleotide positions 
-13,000 to -12,700 contiguous with -8000 to +53; 
(iii) a fragment consisting of from nucleotide positions 
-13,000 to -5,100 contiguous with -1,200 to +53; 

(v) a fragment consisting of from nucleotide positions 
35. -7,800 to -6,00.0 contiguous with -362 to +53;.. 

(vi) a fragment consisting of from nucleotide positions 
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-7,500 to -6,000 contiguous with -362 to +53; 

A regulatory nucleic acid molecule which has the sequence 
of a fragment consisting of from nucleotide positions 
5 -7836 to -7207 contiguous with -362 to +53 is particularly 
preferred, as this construct contains the minimal 
sequences necessary for regulating transcription of the 
human CYP3A4 gene, more specifically, an element 
responsive to xenobiotics (the ^Xenobiotic Response 
10 Element Module" or tt XREM") and the proximal promoter of 
the CYP3A4 gene, 

The regulatory nucleic acid molecule of the invention 
typically contains at least one enhancer capable of 

15 regulating transcription of a human CYP3A4 gene when 
contacted with a nuclear receptor. Examples of such 
enhancers are those capable of regulating transcription of 
a human CYP3A4 gene when contacted with a nuclear receptor 
bound to a ligand, such as a xenobiotic or steroid. Other 

20 examples are those capable of regulating transcription of 
a human CYP3A4 gene when contacted with a nuclear receptor 
consisting of a heterodimer of PXR (pregnane X receptor, 
otherwise known as SXR (steroid and xenobiotic receptor) ) 
and RXR (9-cis retinoic acid receptor) , or CAR 

25 (constitutive androstane receptor-fi) and RXR. 

The inventors believe that certain nucleic acid molecules 
which have substantially the same nucleotide sequence as a 
regulatory nucleic acid molecule of the invention would 

30 also have sufficient genetic information for reproducing 
the constitutive and xenobiotic induced tissue specific 
and developmental expression of the CYP3A4 gene that is 
observed in a human. Accordingly, it will be understood 
that nucleotides could be modified or deleted in regions 

3.5 of the regulatory nucleic acid .molecule, more 

specifically, those regions which do not contain an 
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enhancer such as those described above, without 
significantly limiting the capacity of the molecule to 
regulate transcription of the human CYP3A4 gene. 

5 The inventors recognise that it would be advantageous to 
provide an animal model further capable of reproducing the 
expression of other human genes, specifically those genes 
encoding products which modify or modulate the therapeutic 
activity of exogenous and endogenous compounds used for 

10 therapy and cause drug-drug interactions, for example, 
cytochrome P450 genes or ABC transporter superfamily 
genes, for example, P-glycoprotein (otherwise known as 
MDR-1) . The regions controlling the constitutive and 
xenobiotic induced tissue specific expression of some of 

15 these genes are known, and in some instances, non-human 

animal models have been generated. The inventors recognise 
that the genetic background of these animals could be 
incorporated into the non-human mammal of the present 
invention, for example, by conventional breeding 

20 techniques. 

Thus in a fifth aspect, the invention provides a non-human 
mammal of any one of the first to fourth aspects of the 
invention, further comprising: 

25 

(c) a further regulatory nucleic acid molecule which is 
capable of regulating transcription of a human gene; and 

(d) a further reporter nucleic acid molecule for 
30 producing a detectable amount of a further reporter 

molecule for indicating regulation of transcription of the 
further reporter nucleic acid molecule by the further 
regulatory nucleic acid molecule 

35 wherein the further reporter and further regulatory 

nucleic acid molecules are arranged to permit the further 
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regulatory nucleic acid molecule to regulate transcription 
of the further reporter nucleic acid molecule. 

In one embodiment, the at least one further regulatory 
5 nucleic acid molecule has a sequence shown in SEQ ID NO:4. 
In another embodiment, the at least one further regulatory 
nucleic acid molecule has a sequence shown in SEQ ID NO: 5. 

Although the regulatory nucleic acid molecule of the 

10 invention described herein is sufficient for reproducing 
the constitutive tissue specific and developmental 
expression of the CYP3A4 gene that is observed in a human, 
the inventors recognise that aspects of the xenobiotic 
inducibility of the gene could be better reproduced in an 

15 animal by incorporating at least one human transcription 
factor that is capable of interacting with the regulatory 
nucleic acid molecule for regulating transcription of the 
human CYP3A4 gene. Examples of such factors are nuclear 
receptors. These receptors may be those capable of 

20 regulating CYP3A4 gene transcription in a human when the 
receptor is bound to a ligand, such as a xenobiotic or 
steroid. One example of such a receptor is the human PXR 
(pregnane X receptor, otherwise known as SXR (steroid and 
xenobiotic receptor) ) . Another suitable receptor is the 

25 human CAR (constitutive androstane receptor-P) . Non-human 
animals comprising a human PXR or CAR receptor are known. 
The inventors recognise that the genetic background of 
these animals could be incorporated into the non-human 
mammal of the present invention, for example, by 

30 conventional breeding techniques. 

Thus in a sixth aspect, the non-human animal of the 
invention further comprises at least one human 
transcription factor for regulating transcription of a 
35 human CYP3A4 gene. Preferably the transcription factor is 
a nuclear receptor. Preferably, the nuclear receptor is a 
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heterodimer of the human PXR (pregnane X receptor, 
otherwise known as SXR (steroid and xenobiotic receptor)) 
and human RXR (9-cis retinoic acid receptor) or human CAR 
(constitutive androstane receptor-p) and human RXR. 

5 

It follows that the reporter nucleic acid molecule can be 
any molecule which is capable of detection when the 
reporter nucleic acid molecule is transcribed. For 
example, the reporter nucleic acid molecule could be the 

10 CYP3A4 cytochrome, or the mRNA transcript which is 

translated to produce the cytochrome. Those reporter 
molecules which are commercially available, including 
firefly luciferase, P~ galactosidase, alkaline phosphatase, 
green fluorescent protein or chloramphenicol acetyl 

15 transferase can be used. 

Thus in one embodiment, the reporter nucleic acid molecule 
is capable of producing a reporter molecule selected from 
the group of reporter molecules consisting of firefly 
20 luciferase, P -galactosidase, alkaline phosphatase, green 
fluorescent protein or chloramphenicol acetyl transferase. 

While the non-human mammal of the invention, as 
exemplified below, is a mouse, the inventors believe that 
25 any other non-human mammal could be used in the invention, 
especially those for which standard transgenic techniques 
have been developed including for example, rat and rabbit. 
However, typically the non-human mammal is a mouse. 

30 In another aspect, the invention provides a tissue of a 
non-human mammal of the invention. 

In one embodiment, the tissue is an embryo capable of 
producing a non-human mammal of the invention. 
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In a further aspect, the invention provides a method of 
determining whether a compound is capable of effecting the 
transcription of a human CYP3A4 gene the method comprising 
the following steps: 

(a) administering the compound to a non human mammal 
according to the invention and 

(b) determining whether the reporter molecule is produced 
by the reporter nucleic acid molecule in the mammal. 

In one embodiment, the production of the reporter molecule 
indicates that the binding compound is capable of 
effecting the transcription of the human CYP3A4 gene. 



15 Any compound can be tested in the method however, 

preferred compounds are xenobiotic or steroid compounds. 

The inventors recognise that a non human animal which 
comprises a 5' flanking region of CYP3A4 gene but which is 
20 deficient for the region from -7836 to -7207 would be 

useful as a negative control in a method for determining 
whether a compound is capable of regulating transcription 
of the human CYP3A4 gene. 



BRIEF DESCRIPTION OP THE FIGURES 
Figure 1. CYP3A4/lac2 transgene constructs used to generate 
transgenic mice. The upstream regions of the human CYP3A4 

30 gene are depicted as open boxes with the position of the 
XREM at approximately -7.5kb of the CYP3A4 gene indicated 
by cross-hatching. The 5 '-flanking region extended from 
56bp downstream of the transcription initiation site to a 
Hindlll site at -3,213 in the construct designated - 

35 3CYP3A4/2acZ and to a Kpnl site at -12,926 kb in construct 
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-13CYP3A4/lacZ. The coding region of the E.coli lacZ gene 
together with eukaryotic translational initiation and 
termination signals, transcription termination and poly 
adenylation sites are indicated by a solid box. 

5 

Figure 2. Xenobiotic induction of hepatic transgene 
expression. Female mice from line 9/4 harbouring the - 
13CYP3A4/2acZ transgene were treated with various reagents. 
Histochemical staining of liver. slices with X-gal revealed 
10 an increased zone of blue staining cells containing p- 
galactosidase after treatment with rif ampicin, 
phenobarbital and pregnenolone 16a-carbonitrile compared 
with corn oil treated mice. 

15 Figure 3, Comparison of the xenobiotic induction profile 
of the -13CYP3A4/IacZ transgene with the mouse Cyp3all 
gene. Transgenic mice from line 9/4 were treated with a 
range of xenobiotic reagents and naturally occurring 
steroids. A. Transgene expression was assessed by 

20 determining p-galactosidase activity in total liver lysates 
using the ONPG assay. The units of p-galactosidase activity 
are given as A4 2 o/mg liver/minute. Dexamethasone and 
pregnenolone 16a-carbonitrile were the most potent 
xenobiotic activators of the -13CYP3A4/lacZ transgene, 

25 while rif ampicin treatment resulted in relatively low 
levels. The steroids pregnenolone and 17a-progesterone 
were very weak inducers. B. Hepatic expression of the 
endogenous mouse Cyp3all gene was examined in the same mice 
by Northern analysis. A similar pattern of induction to 

30 the CYP3A4/lacZ transgene was observed with both xenobiotic 
and endogenous regulators. The data are presented as the 
mean +/- the standard deviation for 3 animals. 
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Figure 4. Dose response of -13CYP3A4/IacZ transgene 
expression after treatment with dexamethasone. A. Male 
mice from line 9/4 were treated with from 1 to lOOtng/kg 
5 dexamethasone ♦ Higher doses of dexamethasone resulted in 
increased p-galactosidase activity (determined in liver 
lysates as described in Fig. 3) .B. Zonal expansion of 
transgene expression with increasing doses of 
dexamethasone. X-gal staining of frozen liver sections 

10 revealed greater numbers of hepatocytes containing 

transgene -derived p-galactosidase activity after treatment 
with 1, 10 and 100 mg/kg dexamethasone. At low doses there 
are limited numbers of transgene -expressing cells 
immediately adjacent to the central vein. With higher 

15 doses there are more cells committed to transgene 

expression extending across the liver lobule towards the 
portal tract . 

Figure 5. (SEQ ID N0:1) Sequence of the CYP3A4 5 '-flanking 
20 region included in the -13 CYP3A4/lacZ construct. This 

sequence corresponds to -12,926 to +56 base pairs relative 
to the transcription initiation site of the CYP3A4 gene. 

Figure 6. (SEQ ID N0:2) Sequence of the 5' -flanking region 
25 of the CYP3A4 gene extending from -12,926 to -3,213 base 
pairs and representing the difference in sequence between 
the -13 CYP3A4/lacZ and the -3 CYP3A4/lacZ constructs. 

Figure 7. (SEQ ID NO: 3) The "Xenobiotic-Responsive 
30 Enhancer Module" (XREM) of the human CYP3A4 gene. This 

region encompasses -7836 to -7207 base pairs relative to 
the transcription initiation site of the CYP3A4 gene. 
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Figure 8. (SEQ ID NO: 4) The 5' -flanking region of the 
• human CYP3A7 gene (Genbank Accession No. AF329900) . The 
extent of the sequences is -11,133 to, +52 base relative to 
5 the transcription initiation site of the CYP3A7 gene. 

Figure 9. (SEQ ID NO: 5) Sequence of the 5' -flanking region 
of the human MDR1 gene (p-glycoprotein gene) encompassing - 
10,000 to +200 base pairs relative to the transcription 
10 initiation site of the MDR1 gene. Sequence derived from 
within Genbank sequence Accession Number AC002457. 

An embodiment of the invention is now described in the 
following Example which will be understood to merely 
15 exemplify and not to limit the scope of the invention. 

EXAMPLE 

MATERIALS AND METHODS 

Transgene constructs. Two transgene constructs were 
20 synthesized with the upstream 5' flank of the human 

cytochrome P450 CYP3A4 gene linked to the E. coli lacZ 
reporter gene (Figure 1) . The first construct, designated 
-3CYP3A4/lacZ, contained the region of the CYP3A4 gene from 
the Hindlll site at -3213bp relative to the transcription 
25 start site to nucleotide +56bp downstream of the 

transcription start site. The other construct, designated 
-13CYP3A4/lacZ, included the region of the CYP3A4 gene from 
the Kpnl site at -12,926bp upstream to +56bp downstream of 
the transcription start site. It includes the DNA 
30 sequences of the XREM region located between -7836 and - 
7208 in addition to the proximal promoter of the CYP3A4 
gene. The DNA sequence of the CYP3A4 gene between -10468bp 
and +906bp has been determined and deposited with the 
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GenBank/EMBL/DDJB database under accession number AF185589. 
Additional sequence information covering the region - 
10,469bp to -12,926bp was obtained from publically 
accessible Genbank files. The E.coli lacZ reporter gene 
5 comprises the coding region for the bacterial enzyme p- 
galactosidase flanked by DNA sequences for eukaryotic 
translational start and stop signals, SV40 transcriptional 
termination and polyadenylation signals and anintron. The 
CYP3A4/lacZ transgene constructs were released from vector 
10 sequences and purified on agarose gels prior to 
microinj ection 

Generation of transgenic mouse lines. Mice carrying the 
CYP3A4/IacZ transgenes were created by microinjection of 
the DNA constructs into the pro-nuclei of zygotes harvested 

15 from FVB/N strain mice. Microinjection and manipulation of 
embryos were carried by standard techniques. Stable 
transgenic mouse lines were established by breeding from 
transgenic founders identified by Southern analysis. 
Administration of xenobiotics to mice. 8-10 week old male 

20 and female mice hemizygous for the -3CYP3A4/IacZ and - 

13CYP3A4/IacZ transgenes were used to test the ability of a 
range of xenobiotics and hormones to activate expression of 
transgene -derived p-galactosidase. Mice were administered 
the following reagents and vehicles by single daily 

25 intraperitoneal injection for 4 days: rif airtpicin/corn oil; 
dexamethasone phosphate/H 2 0; pregnenolone 16a- 
carbonitrile/2% Tween 20 in H 2 0; phenobarbital/H 2 0; 
clotrimazole/2% Tween 20; phenytoin/2% Tween 20; 17ot-OH 
progesterone/2% Tween 20; pregnenolone/2% Tween 20. All 

30 reagents were supplied by Sigma Chemical Co. (St Louis, MO) 
except for dexamethasone phosphate which was obtained from 
Faulding (Mulgrave, Australia) and pregnenolone 16a- 
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carbonitrile from Upjohn Co.. (Kalamazoo, MI) . The dose used 
for all reagents to test for induction of the transgene was 
lOOmg/kg body weight. Dose response studies were carried 
out in the range of l-100mg/kg with male hemizygous 
5 transgenic mice. 

Analysis of transgene and mouse Cyp3a gene expression, p- 
galactosidase activity was visualised in slices and frozen 
sections of liver and other tissues by staining with X-gal 
(5-bromo-4-chloro-3-indolyl-p-D-galactopyranoside) . 

10. Tissues were fixed in 0.25% glutaraldehyde, 0.1M phosphate 
buffer pH7.3, 5mM EGTA, 2 rriM MgCl 2 : washed in 0.1M 
phosphate buffer pH7.3, 0.01%sodium deoxycholate, 0.025% 
NP40, 2mM MgCl 2 and stained by incubation at 37°C in wash 
solution supplemented with lmg/ml X-gal, 5mM potassium 

15 ferricyanide, and 5mM potassium f errocyanide . The level of 
p-galactosidase activity was determined in whole liver 
homogenates [lOOmg fresh tissue/ml 0.25M Tris-HCl (pH 7.3)] 
using the O-nitrophenyl-p-D-galactopyranoside (ONPG) assay 
according to standard techniques. After appropriate 

20 dilution the homogenate was incubated with p-galactosidase 
assay reagent (0.1M sodium phosphate buffer (pH7.3)/lmM 
MgCl 2 /50 mmol p-mercaptoethanol/0 . 88mg/ml ONPG) at 37°C, 
quenched by the addition of 1M Na 2 C0 3 and the absorbance at 
420nm determined. The units of p-galactosidase activity 

25 are given as A^o/mg liver/minute. 

The levels of endogenous mouse Cyp3a mRNA expression were 
determined by Northern analysis using a riboprobe 
complementary to nucleotides 852-1061 of the mouse Cyp3all 
cDNA. Filters were stripped and reprobed with an 18S rRNA 

30 oligonucleotide to normalise loading. 



WO 02/36784 



PCT/AU01/01407 



18 



RESULTS 

4 transgenic lines were generated with the construct 
containing the -3.2kb region of the human CYP3A4 gene 
linked to lacZ. Transgene- derived p-galactosidase activity 
5 was not detected in kidney, large and small intestine, 
spleen, lung and liver tissue from mice for all 4 - 
3CYP3A4/IacZ transgenic lines treated with vehicle or 
xenobiotics (Table 1) . In contrast, transgene expression 
was readily detected in 3 of the 4 lines carrying the - 

10 13CYP3A4/2acZ construct. Line 9/4 had a very low 

constitutive level in the liver, with p-galactosidase 
detected only in isolated hepatocytes adjacent to major 
blood vessels- Administration of xenobiotics resulted in 
robust expression in a zone of cells surrounding the 

15 central vein (Figure 2) . As the basal level of transgene 
expression in untreated mice in line 9/4 is extremely low, 
induction is obvious and is essentially an off /on process. 
Expression in other tissues in mice from line 9/4 was 
restricted to the gut, predominantly in the villi of the 

20 small intestine. 

The relative degree of induction for a range of xenobiotics 
was analysed by determining the transgenic p-galactosidase 
activity in liver lysates of mice from line 9/4 (Figure 
3A) . Dexamethasone and pregnenolone 16a-carbonitrile were 

25 the most potent inducers, while rifampicin activated the 
transgene to relatively modest levels. Phenobarbital, 
clotrimazole and phenytoin were intermediate inducers. The 
induction profile of the transgene in line 9/4 was similar 
to that observed for the endogenous Cyp3a.ll gene in the 

30 same mice (Fig 3B) , likely reflecting the activation 

profile of the mouse rather than the human PXR. Activation 
of the transgene was observed with naturally occurring 
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steroids such as pregnenolone and 17a-progesterone # however 
the induction was weak compared with xenobiotics. 
There was a marked gender difference in hepatic transgene 
expression, with lower levels observed in females than in 
5 males for most reagents. Such a male -predominant pattern 
was not evident in the induction profile of the mouse 
Cyp3all gene. Indeed higher levels of Cyp3all mRNA were 
observed in females than males after treatment with 
rifampicin and pregnenolone 16a-carbonitrile. The reason 

10 for this apparent reversal in gender-related transgene 

expression pattern is not known. However, as Cyp3all mRNA 
is only just detectable in males of the FVB/N strain of 
mice, it may be attributed to the relatively greater degree 
of induction of the mouse Cyp3all gene in males compared to 

15 females (Figure 3B) . 

The other line which showed significant transgene 
expression - 15/10, had a higher constitutive level in both 
the liver and small intestine in untreated mice. 
Expression was not detected in other organs, confirming the 

20 tissue specificity observed in line 9/4. The same set of 
reagents were capable of increasing hepatic and intestinal 
transgene expression to the same levels as in mice from 
line 9/4. However, the overall degree of induction was not 
as great as observed in line 9/4 due to the higher basal 

25 level in line 15/10. The induction profile was similar 
with dexamethasone being the most potent activator and 
rifampicin the least (data not shown) . 

Dose response of xenobiotic induction. The activation of 
transgene expression in line 9/4 by dexamethasone was dose- 
30 dependent over the range 1 to 100 mg/kg (Figure 4A) . The 
higher transgene -derived p-galactosidase activity in liver 
homogenates from mice treated with increasing doses of 
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dexamethasone was associated with an expanded zone of cells 
which were stained by X-gal. At low doses of dexamethasone 
a ring of hepatocytes only 1-2 cells thick around the 
central vein expressed the transgene (Figure 4B) . With 
5 lOOmg/kg dexamethasone the zone of X-gal positive 

hepatocytes increased to up to 10 cells, approximately 
midway between the central vein and the portal triad. A 
similar dose -dependent expansion of hepatocytes expressing 
the transgene was observed with other reagents and also in 
10 line 15/10 which also contained the -13CYP3A4/lacZ 
construct . 



WO 02/36784 



PCT/AU01/01407 



21 



CLAIMS 

1. A non-human mammal comprising: 

(a) a regulatory nucleic acid molecule which is 
capable of regulating transcription of the human 
5 CYP3A4 gene and which comprises a nucleotide sequence 

that is identical to a sequence of the human CYP3A4 
gene located between the initiation of transcription 
site of the gene and a position located at least 
13,000 nucleotides upstream from the site; and 

10 (b) a reporter nucleic acid molecule for producing 

a detectable amount of a reporter molecule for 
indicating regulation of transcription of the 
reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

15 wherein the reporter and regulatory nucleic acid 

molecules are arranged to permit the regulatory 
nucleic acid molecule to regulate transcription of 
the reporter nucleic acid molecule. 

2. A non human mammal comprising: 

20 (a) a regulatory nucleic acid molecule comprising a 

nucleotide sequence that is identical to the 
nucleotide sequence of the human CYP3A4 gene that 
extends about 13,000 nucleotides upstream from the 
initiation of transcription site of the gene; and 

25 (b) a reporter nucleic acid molecule for producing 

a detectable amount of a reporter molecule for 
indicating regulation of transcription of the 
reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

30 wherein the reporter and regulatory nucleic acid 

molecules are arranged to permit the regulatory 
nucleic acid molecule to regulate transcription of 
the reporter nucleic acid molecule, 

3 . A mammal according to claim 2 wherein the regulatory 
35 nucleic acid molecule comprises the sequence shown in 

SEQ ID N0:1. 
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4. A non-human mammal comprising: 

(a) a regulatory nucleic acid molecule comprising a 
nucleotide sequence that is identical to the sequence 
of the human CYP3A4 gene that extends about 8,000 

5 nucleotides upstream from a position about 3,000 

nucleotides upstream from the initiation of 
transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing 
a detectable amount of a reporter molecule for 

10 indicating regulation of transcription of the 

reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

wherein the reporter and regulatory nucleic acid 
molecules are arranged to permit the regulatory 
15 nucleic acid molecule to regulate transcription of 

the reporter nucleic acid molecule . 

5. A mammal according to claim 4 wherein the regulatory 
nucleic acid molecule comprises the sequence shown 
SEQ ID NO:2. 

20 6. A non-human mammal comprising: 

(a) a regulatory nucleic acid molecule which is 
capable of regulating transcription of the human 
CYP3A4 gene and which comprises a nucleotide sequence 
that is identical to the sequence of the human CYP3A4 

25 gene that extends about 600 nucleotides upstream from 

a position about 7,200 nucleotides upstream of the 
initiation of transcription site of the gene; and 

(b) a reporter nucleic acid molecule for producing 
a detectable amount of a reporter molecule for 

30 indicating regulation of transcription of the 

reporter nucleic acid molecule by the regulatory 
nucleic acid molecule; 

wherein the reporter and regulatory nucleic acid 
molecules are arranged to permit the regulatory 
35 nucleic acid molecule to regulate transcription of 

the reporter nucleic acid molecule. 
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7. A mammal according to claim 6 wherein the regulatory- 
nucleic acid molecule comprises the sequence shown in 
SEQ ID NO:3. 

8. A mammal according to any one of the preceding claims 
5 wherein the regulatory nucleic acid molecule has the 

sequence of a fragment of the CYP3A4 gene consisting 
of from nucleotide positions -7836 to -7207 
. contiguous with -362 to +53. 

9. A mammal according to any one of the preceding 
10 claims, further comprising: 

(c) a further regulatory nucleic acid molecule 
which is capable of regulating transcription of a 
human gene; and 

(d) a further reporter nucleic acid molecule for 
15 producing a detectable amount of a further reporter 

molecule for indicating regulation of transcription 
of the further reporter nucleic acid molecule by the 
further regulatory nucleic acid molecule; 
wherein the further reporter and further regulatory 
20 nucleic acid molecules are arranged to permit the 

further regulatory nucleic acid molecule to regulate 
transcription of the further reporter nucleic acid 
molecule . 

10 . A mammal according to claim 9 wherein the at least 
25 one further regulatory nucleic acid molecule has a 

sequence shown in SEQ ID N0:4. 
11- A mammal according to claim 9 wherein the at least 

one further regulatory nucleic acid molecule has a 

sequence shown in SEQ ID NO : 5 . 
30 12. A mammal according to any one of the preceding 

claims, further comprising at least one human 

transcription factor for regulating transcription of 

a human CYP3A4 gene. 
13 . A mammal according to claim 12 wherein the 
35 transcription factor is a nuclear receptor. 
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14. A mammal according to claim 13 wherein the nuclear 
receptor is a heterodimer of the human pregnane X 
receptor and human 9-cis retinoic acid receptor or a 
heterodimer of human constitutive androst'ane 

5 receptor-P and human 9-cis retinoic acid receptor. 

15. A mammal according to any one of the preceding claims 
wherein the reporter nucleic acid molecule is capable 
of producing a reporter molecule selected from the 
group of reporter molecules consisting of firefly 

10 luciferase, p-galactosidase, alkaline phosphatase, 

green fluorescent protein or chloramphenicol acetyl 
transferase. 

16. A mammal according to any one of the preceding claims 
wherein the mammal is a mouse. 

15 17. A tissue of a mammal according to any one of the 

preceding claims. 
18. A tissue according to claim 17 wherein the tissue is 

an embryo capable of producing a mammal according to 

any one of the preceding claims. 
20 19. A method of determining whether a compound is capable 

of effecting the transcription of a human CYP3A4 gene 

the method comprising the following steps: 

(a) administering the compound to a non human 
mammal according to any one of the preceding 

2 5 claims ; and 

(b) determining whether the reporter molecule is 
produced by the reporter nucleic acid molecule 
in the mammal . 

20. A method according to claim 19 wherein the production 
30 of the reporter molecule indicates that the binding 

compound is capable of effecting the transcription of 
the human CYP3A4 gene. 
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Fig 1. Human CYP3A4//acZtransaene constructs 
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CYP3A4 upstream lacZ 
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-13 CYP3A4/lacZ 
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-13kb -7.8 -7.2kb +56bp 
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CYP3A4 upstream lacZ 
region 



SUBSTITUTE SHEET (RULE 26) 



WO 02/36784 



PCT/AU01/01407 



2/22 




SUBSTITUTE SHEET (RULE 26) 



WO 02/36784 



PCT/AU01/01407 




SUBSTITUTE SHEET (RULE 26) 



WO 02/36784 



PCT/AU01/01407 



4/22 



Fig 3B Xenobiotic induction of mouse CYP3A11 mRNA 
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Figure 5. 

CTGGTTCATCTGATT^ 

GCX3TCGCCIXACCTGGGAAGCAC1AAGG 

TGAGCCTGAGGAACTGTGCACrrCIt^CCCAG 

GGAGATTCCCTCCGGTGCCTATGCC^CCAGGGCCCrGGGT^ 

AGACACCGAACTAGCIX3CAGGAGTTTTTTT^ 

GACAGAACCXnTCACTCCCCTGGAAAGGGGGCTGAAACCAG 

ACCCCCATGGAGCCC^GCAAACAAAGATTC^ 

AGATTGACCTGGGACCCTCXSAACTTGGTTGGGTGC^^ 

GGTGGTTTTACXrTTCXSCX^TGTAAACA^ 

GTAAGGCC^CTGTGGCCAGACTGCCrCTCTGGA 

AGCAGCCCCAGTCAGGGACT^ATAGATGAAArc 

GCTTCCACX!ATTGTGGAAGACTGTGTGGCAA^ 

CAATCX^GATTACTGGGTGTATACCCATAGGAT^ 

TTTATTGTAACACTATTTACAATAGCAAT^ 

AAAGAAAATGTGGCACATATACACTGTGGAATA 

CAGAGACATGGATGAAGCTCGAAACCATCA 

ATGTTGTCACTCATAAGTGGGAGTTAAACAATGAGA^ 

GGGCATGTCX?GGGA6TGGGGGCXr£AGGGGAGGG 

GATGGGTGCAGCAAACCACCATGGCACATATACACXn^A 

CAGAACTTAAAGTATAATTAATAATAATAATAATT^ 

TTGATACATATTCTGAGAGAATTAAACCT^^ 

C<Xrr^TAATGTGTTTTGGGGTAAGCCT 

CTTCCTGAAGTTCATGTGCAAAGTTG^ 

AGGAAAATATTAGATATGTGTCATCACTGCT 

TCTCAGCCACCATGCCTG CATTTTATCTCTGTCTCGTGGTCTGC^^ 

TAGAATCCTGGGCATCAAGAAC^TGTGGTTC 

ACGGTCACXnXjAGTAAACAACAGATTTGGAG 

TCCAGTTGTCCrrTTCATTTC 

TGCCTTCTCCAATGCTCCCTTCC^ 

TTCAATTATTTGTTCAGTAAAACITGAACT 

TGTCGGGTTACTCTATAACTCTTGTCCTGT^ 

ACGTTTATGGAAGGACTGCCAAGAGTCAGGTACTAGG(^^ 

CAACACCAGCCAGACTTGCAiGATCTAGGTCC 

TGAGCTCCTTTGGGAGGGAAAGACAGAATTATTAACA 

AGTTTAAATAACTC^GACAGGGCCro^ 

ACAGAACGAGATGTCTCTGGAGTCCATAQAAAGCCCAG 

ATCCXIAGCACTTTGGGAGGCCGAGGCAGGCAGATC^ 

ATGGCAAAACCCCATCTCTACTAAAAATACA^ 
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GCTACITGGGAGGCIXSftGGCACavAGAATTC 
GCX3VGTGCACTCCAGCOT 

ACAAAAATAGAAAGCCCAGGGACCACCTGCGTCAG^ 
TCTGGCATCTTCTCACAGGTTCCrA^ 
5 TCAC^CCCACTCAGATGGCrrATTTT^ 
GGAATTCTCACCCATTGCIX3GTC 

CAAAAAGTTCAACAGAATTTCAATGTGACCCAGCAA^ 

ACAGTTACTAAAATACACXX^CriXZACATATTTCT . 
CCCACATGTCCATCGATGGATGGATAAATAAATTGTGGT^ 
10 AATGGAATGAAGTACTGACGCATGCTAOVGAAT^^ 
GAATGGATAGCXrK^CTTTACTATCAAGTGAAGG 
TCCAGAAGAGGGAAGCCXIACAGAGACAGAATC 
GGGGAGAAACTGCTCAACTGGTACAGGCTTTA 
AGTGATTGCAGAACACAGAATGTACTGAATTCCACT 
15 CTGAGATTGGAT7UVTCATAAAAAAATGGTTAATOT 
ACCTCAGAAATATACACTAGCAGGCATGGA^ 
GGGGTCCXXXSGCCAGCCrDtf^ 
TCCACATGCTGGCATGATTC^ 
CACTGTACCTTGCAACCAAAAAATG^ 
20 CAGTTAAATTATATTTATTATAGCTTGACATG^ 
GACATGAACATTTTATCAAAGAAGAAAT^ 
AAGTGGCTGGGCTCAGTGGTTCATC 
TVAGCCAGAAGTTCAAGACCATCCIA^ 
AATTAGCCAGGCATGGTGATGCATGC^^ 
25 AGCCTGGGAGGCTGCXjGTGGCGGTGAGCCATC 
CTGTCTAAATAACAAAAATAATAGTAAAGAAAA 
AGTTAAAAGTAATTTCTACGTTTTCT 
ATGCTTCTCACTTTCCCTTCTTCl^ 
GACTTCACCCCAGGTAGTTCXIA^ 
30 CTCCGTGTGGTGAAOTACCCTACCTAGGtf 
(XrTTATGGTTCroTTGCCXTGTGTTGTC 
| GAGGGCCCCATGGTCCTGGAGATAACCATGTAACCGA 
TGGGCTGCACTTGGGTGGGCAGGTC 
TCTGGGGTCCCCCTTGGAACriTCA^ 
35 TTTCATTTGATTATCAAAGAAACTCATGT^ 
TCTCAGCTGAATGAACTTGCTGAC^ 
CAAGCAGCCTrCATGAGGCATTACAAAGT^ 
CAGGTGAATCACAAGCTGAACTTCTGACAAG^ 
GTTAATTGATAGGATTTTTTTTGTTTAAAAAAA^ 
40 AAACACTAAGAAGGTATAAGAGCATAATGCTTTTATA 
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GGTAGATGCTTTCATCAGATTAAGAAAATTCCCTGOT 

AGTTGAATATTATTATCATATATTATTAATATATTGTTATTGAACT 

GAGATGATCTTATAACC^TTCTCCrTTAACCrc 

ACCATTCTTGAGTCTCAGGTTTAAATTCAACrr^^ 

CTTGCTACTGTTTTGTTTAGGATTTTTC 

AGTCCTGATTTTTTTCrGATTTGG 

TTTTCATGTClXnXjGATTGATGGGACACTTTGG 

ATTGTTAGGCCCCTXSGGCAAGCCCrrTCrCA 

GTTTGTTTCTTGAGTCGGAGTCIX^C^ 

TCACTGCAACCnXX!ACCTCCCAGATTCAA 

GCACCCACX^ACACACXrOTGCTAATTTOT 

GGTCTCGAACTCCTGATCTCATGATCTGCra 

ACCACACCCAGTGAACCrrcTGGTTTTTAGAAGCTC 

CAGCCACTGTGTGTTCAGCTGTTGGAACTC 

GAGTTCATGGAAGAGGCTTGTTGGGGAGATGATC 

AGATGGAAGCTGGGTTGGACXX^CTCTATGC^^ 

TCATTTTTGTAGAGCCTGAGGAGTGTCX^ 

GGACTGAGACCGAATGAAGAAGCTAACAGTGGC^ 

TTAGATTTTATCCTGTGGGTGATAGGAGC^G^ 

TCTATGTTTTTC^^GACXK!TTTT^ 

GCAAAACAAGTTAGGGGGTTACTGAAATAAGGAGTA 

CACATCTCATGATGATCTTCAT^^^ 

GAAATTTGCerCACATTCTCIX3A 

GGAAACACAAATGATTCTGATTGTGACATGTCAGAA 

AGTTCACTGAATACACAATTTAGTAAAGTGTAGG^ 

AGCATTTTATAACTACACTGGCAGT^^ 

ACAACCTGCCAAGTTATTTTAGGAGCX?^^ 

ACAGCATCCTGCTCCMGCCTTC^ 

CX^TGTCTATAAAGTATGAGTTAAATCATCCT 

GAGAGTTGGGAAAGAACTGTATAAACTGTGAAGTGC^ 

GACACTCTAGAATGGAGTAGAAAGCCAACAGTTATGATO 

TATAAAGAAAAGAGGTTTAATTGACrcACAGTTC<^ 

AGCAGGAGGCAAAGGGGAAGAAGGCACCTTCrrrC^ 

TGTCATAAACnXH'ACAGAAGTGCTTATACTTC^ 

ACXXKIAGC<XrrCACTGACGAGTTTGCTAGG 

TACCCATTCAGAAGGCCTGGGCTGCACTGCT 

CCTGTGTTAACAGATTATGACXXritXX^ 

GCAATTAGTGATGTAAACTTTAATGGGTCAGTTAA 

GGATCATAACAGTATCAATACCA(^TGATTGTTGGAC^^ 

CATGACACGTATTCACTATCATTTCCTGGAG 
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TTTTCTCTGCAATCrCAGTTAAGAAA 

crccca^GTTCcjmxrrATCxxA 

GGAAGACCTGGTGGACCCTAAGTGATCTATAAGAGG 

AGGGACACAGAGGCTGAGAGGCAGGC^GTCC^ 

CACAGGCCAATTCTGGTCAGTTTC 

CACTTCCCATCTGTCAAGTCTTGTTGATTCTGT^ 

AGAAAGAAGGATCAACTGTGTCTGATACX^OT^ 

ATCXZCAATTTGTAATTCAGTATTATTTGTGTGGC^^ 

CATAAGGGCAAGGATTTTGATTCTTTAA ' 

GCTAACXIAACATACAGTGGCATGAAAGTCAT^ 

AA<X^TGCACCTGTCACTCTC 

CTCOVGCXACTAAAACTTAAAGGCCAGGTC 

AGCAGGAGGATCACTTGAGGCCAGGAGTTTGM 

AAAATTAGCTGGGCGTTGTTGCATGCCTGTAGTCCC^ 

AGCCCAGGAGGTGGAAACAA(^GTAAGCTATAATC^ 

CCCTGCCTCAAAACAATTTTAAAA^ 

AAATCIAAGTTTTCXXXniACTGAGAAGAATGGGGACT 

CrrriUnUVX"XU"l"l , GGTTTACyiTCCTC 

CAGAGCAATGTTGCCTCCGCATACCAT^ 

J\GAGAGG1ACX^jOTTGGGCTACCJ^ 

GAGTCXX!TGGACAGC<nXnXX^ 

TTTGAlKXIAATGAAACrrAGAACXSAGAAA 

TACTTGTATGATTATTTTTCCT^^ 

GGTACAAAGTTQACCAAGACCAACTTTGG^^ 

TTGAGAGCACAGCTTTAAAAAACATGATGAAGGCTTT^ 
GGTTGGATGTATCTTCCTTGACACAGT^ 

TACCTTCTGTGGAATTAAGTCGCAGAACrrT^ 

TAGTTAGGAOVCAGTGCTGGCrra 

AAACCAATCAGACCACXnXXrrAAAGTTTGTAG^ 

CAAGTATGTGTTTTAGAGGCAAGAAGAA^CATATTTC 

GAAAGGCTCTCX^TTTAGGTGAGGTAATGAAGTTGTTC 

TCTGAAOTTGTTCTGTGCTm^TCTCAGTC 

TGGTCT^GAAACTGCATCTTTCTTT^ 

TGGTGCCGCTCGGAGTAACCCT^ 

TCACTATTTCCCATAGTGATOCGTAG^ 

CCTTCCTGTTAGCi w lTT TA T 

CCTCAGTTTCTACATCTGTAAACTGGG 

AAAGGAQACAGAAATGTATCTAAATTACC^TGGACTTTra 

<XX!ATGTTTGTCX!TTTTTGGTTTGAT^ 

TTATCCCAGGGATTCCAGAJGTT^ 
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ACTAGGCTGTCCCCAAGGAACmTGT^ 

ACTTGGCCTTGAATCTGACAAGGAATCTXKjC^ 

GAGGCTTAGGGGCGAGAGGGATGGGAAAAGGTGTCTT^ 

AATGTGAGAAAGAATAAAGAGAGAAGGAAGCGAACAAGCACAGC^ 
5 TTAAGAACGCTGTCATTTTGTTTGTCATC 

AGTGACGATAATGAGCTOATATAATTTTC 

GGAGCTCACAGGTGGGAGCAACTGCTATTAC^ 

GATTAGAAAATCCrCGACTTGTCAAAA^ 

GGGACC^TCITACTGGAGTCCTTGATGCT 
10 GTGGTTGGGGTCCATCTGGCTATCTGGGCAGCTG 

AGTATTTCCAGAGAGAAGGGGCCACTCTTTGGCAA^ 

TGAAGGGTTCACAGGAAGCAGCACAAATTGATAC^^ 

CTGTCTItntXrrTTAGGGGTCCXX^ 

CTGAGGCACAGCCAAGAGCTCTGGCTGT^ 
15 TGCAGAGGCCCAGCAATCTCAGCT^ 

CX^TGGTAGGGACCAGAGCCATGACAGGGAATAAGACT^ 

GGAAACAGGCXTIXSGAAACACAATGGTGG^^ 

GGTGCCCAGGGGAGGAAATGGTTACATCTGTO 

GTCTGTCTGGGTTTGGAAGQATGTGTAGGAGTCT 
20 ATCTCTAGGTGTGGCrrTGTTGGGATGAAT^ 

AGAGAGAGGCX3ATTTAATAGATTTTATGCCAATOT 

GGACTCXICCAGTAACATTGATTGAGTTGT^ 

GTGGTCTK3TCTX3TGATTCT 

TGGCCCrcCTACTGGCTGCTVGCTC^ 
25 GAATCACTGCTOTGCAGGGCAGGAAAGCT^ 
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Figure 6. 

CTGGTTCATCTCATTGGGAC^ 

GCX3TCGCCTCACCriX3GGAAGCaCAAGGGGTC^ 

TGAGCCTGAGGAACTGTGCACTCTGGCCCAGA^^ 

GGAGATTCCXTTCCGGTGCCTATGCCACCAGG<^ 

AGACACCXSAACTAGCTGCAGGAGTTTTTT^^ 

GACAGAACCX3TTCACTCC(XrrcX3A/^^ 

ACCCCCATGGAGCCCAGCAAACAAAGATTCACTT^ 

AGATTGACCTGGGACCCTCGAACriTGGT^ 

GGTGGTTTTACCTTttXX^^ 

GTAAGGCCACTGTGGCCAGACTGCCTCTC^^ 

AGCAGCC(X!AGTCAGGGACTTATAGATGAAACCC^ 

GCTTCCACXATTGTGGAAGACTGTC 

CAATCCCATTACTGGGTGTATACXX^TAG 

TTTATTGTAA(^CTATTTACAATAGCAA 

AAAGAAAATGTGGCACATATACACTGTGGAATACT^ 

CAGAGACATGGATGAAGCTGGAAACCATCATTC^ 

ATGTTGTCACTC^TAAGTGGGAGT^ 

GGGCATGTCGGGGAGTGGGGGCCTACGGGS^GGGATAG 

GATGGGTGCAGCAAACCACCATGGCACATAT^ 

OttSAACTTAAAGTATAATTAATAAT^ 

TTGATACSITATTCTGAGAGAATTAAACC^ 

CCKTTAATAATGTGTTTTGGGGTAAGCH^ACTCATATO 

CTTCCTGAAGTTCATGTGCAAAGTTGAGT^ 

AGGAAAATATTAGATATGTGTCATCACroCTGGTATT^ 

TAGAATCCTGGGCATCAAGAACATGTGGTC 

ACX3GTCACX3TGAGTAAACAACAGATTTGGAGGG 

TCCAGTTGTCCTTTCATTTCCTCC^ 

TGCCTTCTCXAATGCTOXri^ 

TTCAATTATTTGTTCAGTAAAACTTGAAOTCATC 

TGTCGGGTTACTC^ATAACTCTTGTCCKn^ 

ACGTTTATGGAAGGACTGCCAAGAGTCA(^ 

CAACACCAGCCAGACTTGCAGATCTAG^ 

TGAGCTCCTTTGGGAGGGAAAGACAGAA 

AGTTTAAATAACTCAGACAGGGCCTGCACGTC^ 

ACAOAACGAGATGTCTCTGGAGTCCATAGAAM 

ATCX!CAGCACITTGGGAGGCCX3AGGCAG 

ATGGCAAAACCCCATCTCTACTAAAAATACAAAAAATTAGC 

GCTACTTGGGAGGCTGAGGCACAAGAATTGCTTGAG 
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GCCAGTGCACIXXIAGCCTG^ 

ACAAAAATAGAAAGCCCAGGGACCACCTGC^ 

TXnXKXATCTTCTCACAGGTTCCT 

TCACACCCACTCAGATGGCTATT^ 

GGJ\ATTCTCACCCATTGCKK3TGAGAATGT 

CAAAAAGTTCAACAGAATTTCAATGTGA 

ACAGTTACTAAAATACACXK3ACTCACATAT^ 

CCCACATGTCX!ATCX3ATGGATGGATAAAXAAATTGTGGT^ 

AATGGAATGAAGTACTGAOX^TGCTACAC^ 

GAATGGATAGCCTCACTTTACTATGAA^ 

TCCAGAAGAGGGAAGCCH^CTVGAGACMSAATGTC 

(XXSGAGAAACTGCTCAACTGGTAC^ 

AGTCATTOC^GAACACAGAATGTACTGAATT^ 

CTGAGATTGGATAATCATAAAAAAATGGTTAATT^ 

ACCIXIAGAAATATACACTAGCAGGCM 

GGGGTCCCCGGCCAGCCTDU^^ 

TCCACATGCTGGCATGATTCCTGGCC^ 

CACTCTAC^TITGCAACX^AAAAAATC 

CAGTTAAATTATATTTATTATAGCTTGA 

GACATGAACArrTTATCAAAGAAGAAATGCAGC^ 

AAGTGGCTGGGCTCAGTGGTTCATGCCTCT 

AAGCCAGAAGTTCAAGACCATCCTAGGCAAGTC^^ 

AATTAGCCAGGCATGGTGATGCAT6CCTO 

AGCCIXSGGAGGCTGCGGTGGCGGTGAGCCATGAT^ 

CTGTCTAAATAACAAAAATAATAGTAAA^ 

AGTTAAAAGTAATTTCTACGTTTTCTA l^ 

ATGCITCTCACrrTTCCe^^ 

GACTTCACCCCAGGTAGTTCX^AGATCCCCriT 

CTXXXjTGTGGTGAAGTACCCTACCTAGGGTTTC^ 

CerTATGGTTCTGTTGCCCTC^^ 

GAGGGCCCCATGGTCCTGGAGATAACCATGTAAC 

TGGGCTGC^CITGGGTGGGCAGGTCCCCTGGA^ 

TCTGGGGTCCCCCTTGGAACTTCATGCAAA^ 

TTTCATTTGATTATCAAAGAAACTCATGTCCC^ 

TCTCAGCnSAATGAACTTGCTGAC^ 

CAAGCAGCCTCATGAGGCATTACAAAGTTTAATT^^ 

CAGGTGAATCACAAGCTGAACTTCTG^ 

GTTAATTGATAGGATTTTTTTTGTTTAAAAAA^ 

AAACACTAAGAAGGTATAAGAGCATAATGCTTTTATACrrACrrA 

GGTAGATGCTTTCATCAGATTAAGAAAATTC 
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AGTTGAATATTATTATCATATATTATTAATATATTC 

GAGATGATCTTATAACCATTCTCCTT^ 

ACCATTCTTGAGTCTCAGGTTTAAATTCAAC^^ 

CTTGCTACntnTTTGTTTAGGATTTTTG 

AGTCCTGATTTTTTTCTGATTTGGATCATG^ 

TTTTCATGTCTCTGGATTGATGGGAC^^ 

ATTGTTAGGCCCCnXjGGCAAGCCCTTCTC^ 

GTTTGTTTCTTGAGTCGGAGTCTC^ 

TCACTGCAACCTCCACX!TCCCAGATTCAA^ 

GC2\ccc^cxx^cAC^cx:crrc 

GGTCTCGAACTCCTGATCTCATGATCTC^ 

ACCACACCCAGTGAACCTGTGGTT^ 

CAGC(^CIXnXnX3TTCAGCTGlTGGAACTC 

GAGTTCATGGAAGAGGCTTGTTGGGGAGATGATC 

AGATGGAAGCrTGGGTTGGACCCACTCTA^ 

TCATTTTTGTAGAGCCTXSAGGAGT^^ 

GGACTGAGACCX3AATGAAGAAGCTAACAGTGG<X!AGAT^ 

TTAGATTTTATCCTGTGGGTGATAGGAGCA^ 

TCTATGTTTTTCAAGACGCTTT^ 

GCAAAAC^AGTTAGGGGGTTACTGAAATAAGGAG^ 

CACATCTCATGATGATCTK^ 

GAAATTTGCCTCACATTCHXITC^ 

GGAAACACAAATGATTCTGATTGTGACATGTCAGAAT^ 

AGTTCACTGAATACACAATTIAGT^ 

AGCATTTTATAACTACACTGGCAGTG^^ 

ACAACCrTGCCAAGTTATTTTAGGAGCC^^ 

ACAGCATCCTGCTCC^TGCCTTC 

CCATGTCTATAAAGTATGAGTTA^ATCATCCTAAC^ 

GAGACTTGGGAAAGAACTGTATAAACTGTC 

GA<^CTCTAGAATGGAGTAGAAAGC^ 

TATAAAGAAAAGAGGTTTAATTGACTCACAGTTCCA 

AGCAGGAGGCAAAGGGGAAGAAGGCACXTTTOTTC^ 

TGFTCATAAAGTCTAGAGAAGTGCTT^ 

ACCCCAGCCXTIX^CreACGAGTTTG^ 

TACXXIATTCAGAAGGCCTGGGCTGCACTGCTAGT^ 

CXJTGTGTTAAC^GATTATGACCCTGGGCTCTCA 

GCAATTAGTGATGTAAACTTTAATGGGTCAGTTA 

GGATCATAACJIGTATCAATACCACATGATTGTTGG^ 

CATGAC^CXCTATTCACTATCATT^ 

TTTTCTCTGCAATCTCAGTTAAGAAACCAATC 
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CTCCCAGTTCCATCCTATCC^ 
GGAAGACXrTGGTGGACCCTAAGTGAT^ 
AGGGACACAGAGGCTGAGAGGCAGGCAGTCCTGC^ 
CACAGGCCAATTCTGGTCAGTTTCXX:(X^ 
5 CACTTCX!CATCTGTCAAGTCTTGT^ 

AGAAAGAAGGATCAACTGTGTCTGATArc 

ATCCCAATTTGTAATTCAGTATTATTT^^ 

CATAAGGGCAAGGATTTTGATTCTTTAATAT^^ 

GCTAACXIAACATACAGTGGCATGAAAGTCATGAAAGTGAGACACCT^ 
1 0 AACC^TGCACCIX5TCACrr(nx:CTCAACAC 

CTCCAGCCACTAAAACTTAAAGGCCAGGTGT^ 

AGC^GGAGGATCACTTGAGGCCAGGAGTTTGA 

AAAATTAGCTGGGCGTTGTTGCATGCCrrGTAGTC 

AGCCCAGGAGGTGGAAACAACAGTAAGCTATAATC^ 
15 CCCTGCCTCAAAACAATTTTAAAAATAAAT^ 

AAATCAAGTTTTCC(XrrACTGAGAAGAATGGGGAC^ 

CTTTTTTTXTTTTTGGTTTACATC 

CAGAGCAATGTTGCCTCCGCATACCATC 

20 
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Figure 7. 

TCTAGAGAGA TGGTTCATTC CTTTCATTTG ATTATCAAAG AAACTCATGT CCCAATTAAA 
GGTCATAAAG CCCAGTTTGT AAACTGAGAT GATCTCAGCT GAATGAACTT GCTGACCCTC 
TGCTTTCCTC CAGCCTCTCG GTGCCCTTGA AATCATGTCG GTTCAAGCAG CCTCATGAGG 
5 CATTACAAAG TTTAATTATT TCAGTGATTA TTAAACCTTG TCCTGTGTTG ACCCCAGGTG 
AATCACAAGC TGAACTTCTG ACAAGAACAA GCTATCATAT TCTTTTCAAT TACAGAAAAA 
AGTAAGTTAA TTGATAGGAT TTTTTTTGTT TAAAAAAAAT GTTACTAGTT TTTGAAAAGG 
TAATATGTTG CACATGGTAA ACACTAAGAA GGTATAAGAG CATAATGCTT TTATACTACT 
AAGAATAATG TTTTCTCTAA GTTTTTTTTG GTAGATGCTT TCATCAGATT AAGAAAATTC 
10 CCTGCTATTA GTTGTTGAAG GTTTTTATAT CATAAATGAA AGTTGAATAT TATTATCATA 
TATTATTAAT ATATTGTTAT TGAACTATCA AAGCCTTTTC CTAAAACCAT TGAGATGATC 
TTATAACCAT TCTCCTTTAA CCTGTTGACG AG 
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Figure 8. 

GGATCCAGTTTCAGCITTCTA 
TCCCCATTGCTTGTTTTTGTCAGG 
5 CCTCTGTTOTGTTCCATTGGTCCATATC 
TTGTAGTATAGTOT 
AATGTGGGCTeiA-iriu^^ 

TGGTAACTTOATGGGGATGGCATTGAATCrATAAAT^ 
TTCITCCTATCCATGAGCATGGAAC^ 
10 TT6TAGTTCTCCTTGAAGAAGT(X!TT^ 

TTTTGCGCATTAMTT^ 

GACGATGGGGTTTTCTAAATATACTUVTCATGGCAl^ 

ATTGAATACCCTTTATTTCTTTT^^ 
1 5 AGTCA^ 

TGGTTTTGCCATAAATAGCTCTITATTATTTT^ 

AGCAGGAAGGGCTGTTGAATTTTGTCGAAGGCC^^ 

TCXjTTGATTCTGTTTATGTGATGGATTAC^ 

GATGAAGCCCACrrTGATTGTAGTGGATAAGCTT^^ 
20 TGAGGATTTTGGCATCAATGTTCATCAGGGATAT^ 

GGCrTT TGGT ATCAGGATGATGCAGG 

GAATAGTTTC^^ 

GTCCTGGACTTTTTGGTTGGTAGGCTAOT 

ACATTCAACTTCTTCCCXXTT^^ 

25 TTTCTAGTTTATTTGTGTAGAGGTGTTT 

GTGATATCCCCITTATCATTT^ 
TCGCX3GTCTG ^ 

TTTTGTGTCTCTATTT^^ 

TGTTTGCTCTTCCTTCTCTAij 
30 TCTTGTGGTCATTTAGTGCTATAAATTTCC^ 

GTTGTGTCTTTGTTCTCATTCATTT^ 

TCATTCTVGGAGCAGGTTGTTCAGTCTT^ 

TAATTTGATTGCACTGTTGTCI^^ 

TTTATGTCCCATTATGTGGTCAATTTTAGAATAAGT^ 
35 ATTTGGGGTGTCGJVGOTCT^^ 

TATCAAAGAAGAAACACAGCTATC^tf^^ 

CTGGTTCATTCCTGTAATCrcAGTCCT 

GACXATCCTAGACAAGTaVGTTCAA^ 

GTGATGCATGCCTGTCATCCXIAGCTACT 
40 AGTGGCAGTGAGCCATGATTGTGCCATTGC^ 

AATAATAATAATAGTAATAGTAATAATAATAATAATAAAGAAAAC^ 

CAATACACAAAGTTAAAAGCAATTTCTACTO 

AC^GCCTGAAATGCTTCTCA(XrrTC 

CTIXrKCTTTAGGAATACTTC^ 
4 5 GTCTCCCCAGTTCCCTCrroTGTGGTC 

CTTGTTCACATTTTCCCTTATGGTTCT^ 

ATTCACACCGATAGAGAGGGTCCCATGGTT^^ 

GCTGGGTGTCAGGCATGGGCTGCACTTG^ 

OTAGTAACATCTCTCCCTGGGGTCCTCI^^ 
50 GATGGTTCATTTCAATAATCAATGAAACT 

GATAATCTCAGCTK^TGAACTTGCT^ 

CAGTTCAAGCSMX^aXATGAGGCATT^ 

GACCXXIAGGTGAAT(^CAAGC?TGGACTTCTGAC^^ 

GTAAGTTAACTGATAGGATTTTTTAAAGATGTTACT 
55 AGAAGGTATAAGAGGATAATGCTTTTATACTGCTGM 

TTTCATCAGATTAATAAAATTCACIX3CTGTTAGT^ 

ATTATCMCTM^ 
. ATAATCGCTCTCCTTTAATCKnTGATGAGATC 

GTCrcAGGTTTAAATTCAACTTGGT^ 
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TGTGTTCAGCATTTTTGCACTGATGCC^ 
TTTTCAGATTTGGATCATGTGGCCCTCATTC 
TTGATGGGACACTTTGGAGTCTCI^ 
GCAACCCTTTCTCATTTCTGGTAGG^ 
5 GTTTTTTGAGTTGGAGTCCTGCTTTC 
TGCAACCTCCACCTCCIX3GG 
CCACXACCACTCCTGGCTAATTTTTCT 
TCTTGGACAGGCTAGTCTCAAATTCCTGACCTC^ 
ACAGTTTTGTGCCTCCAC^CACAGTGAA^ 
1 0 CATCC(X3GGATGACAGCCACTX3^ 

GCATTTGCTTGCTGTGGTCATGAAAGAGGCTTC 
TTAGGAATGCACC^GATGGAAGCTGGGTTGGACCCAGTCTATC 
AGACACGAACACATCATTTTTGCAGM 
CACTTTGGAGAAGTGGACTGAGACAGGCTGAAGA 
15 TTCCTAGATATACTTAGATTTTATCCTGTGAGT^ 
GCTTTAAGATTCCATGTTTTTTGAGA 
CAGGGTAGGCAAAAGAAGCXAGGAGGTTACTGAAATA 
TTTTGTAACAC^TCTCATGACAATC^^ 
GCTCTTCTGAAATTTGCCTCATATTCnTTG^ 
20 GCTTTOnXSGAAACACAAATGATTOT^ 

AAACATGAGCTCACTGAATACACAATTTAGTAAAGTGTAGGATGC^ 
GCTCTGTAGCATTTTATAACTACACTGGCAGTGT^ 
GGCTGACACAACCTGCCAAGTTGTTTTAGGA^ 
TGAAATCACAATGTCCTGCTCX!ATGCCC^^ 
25 CTGTTTCCACATCrATAAAGTATGAGTTAAA 

TATTAGGAGAGTTGGGAAAGAACTGTATAAATTATC 
GAAATAGACACTCCAGAATAGAGTAGAAAGAAAACAGT^ 
AATTTACAAAGAAAGGTTTAATTGAGTCACAGTTC^ 
GGC^GTTGGCAAAGTGGAAGAAGGCACCTTCT^^ 
30 TGTTGTAAAGTCraCAGAAGTGCAT^ 
AGC&CAGTCCTCACTGACAAGTTC 
ACTACX!CATTCAGAAGGCCTX3AGCT 

GGCTTATGTCAAAAGATTAStW^ 

GAGCAATTAGTGACGT7ACCTTGGATGGGTCA 
35 AGGGATCATAACAACATCAATACXAT^ 

GCATGACACGTATTCACTATCATTTCCTT^ 

CXriT^CACTGC^UVTCrCyVGT^ 

TCI^CAGTTGCATCCTATCCCACXn'TT^ 

GGAAGACCTGGTGGACCCTAAGTGATCT^ 
40 CTAGAGGGAC^CAGAGGCraAGAGGCAGGCA^ 

GTCTTCATAGGCpAATTCTGG 

TCTCACACTTCC(^TCTGTCAAGTC^ 

AAGGAAAGAAAGTU^GGATCAACIXntmrrGA 

GCATTTTTCXX!AATTTGTAATTCAGT 
45 TAGCTTCATCAGdACAAGGATTTT^ 

ATAGAGACTAACCAACATACAGTGGGATCGAA^ 

CATGGACCTGTCACnrrcCTCAACACC^ 

AGCCACCAAAACTTAAAGGCCAGGTGTGGTGGCTC 

GGAGGATCACrTGAGGCOVGGAATTTGA 
50 TTAGCTOGGCATGGTGGTATACXrrGTAGTACCAGCT ' 

CAGGAGGTGGAAGCTACAGTGAGOTATAACCACAGC^ 

GCCTCAAAACAATTTCAAAAATAAATAAATAAAAACAAAA 

AAATCAAGTTTTCCXXrTACTGA^ 

TTTTTTTTTTTTTGGTTTACATC<^ 
55 AGAGCAATGTTGCTTCT<XXI!TACCAC 

GAGAGGTACX^CTTGGACTACCAATATAAAAGGATGAAAATAm 

AGTCCCTGGAC^GCCTGTCXIACTCATGCTGAT^ 

TTGATCCAATGAAACCAGACCAAGAAAGAAGAAACGTC^^ 

AATTTTATGATTATTTTTO^TAGCTGTCCTAT 
60 GTACAAAGTTGACCAAGACGAAGAATAATGTCTGGGAGCACAA 
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TGAAIXXTTTTAATACAGGAAA^ 

TCTTCCTTGAC^CAGTCAGTGC^ 

GAATTAGGTGGC^GAACATGCXrrrCAAT^^ 

GAGTGCTGGCTX3CATTTGAATTCCAAGCAA 
5 ACCACCTCCTGAAGTTTGTTGGGCAGGTAAAT^ 

TTTTAGAGGAAAGAAGAAAA<^TAATCGTTTCCATAAGAGT^ 

TGTTTAGGTGAGCTAATGAAGTTGTTGATAtf 

TCTGTGCACCTCTCGGTGTGGCAAt^ 

TGCATCTTTCTTTCTCX^CTGC^ 
1 0 ACCCITGAGACAGGAACACXtfUlTCT^ 

GATGCTTAGC^UVGGAATGGCAGGTGC^C^ 

TATAATATTTTATTGTGATCAGTATCAGTTGGGAAGOTACT 

GTAAACKX3GGATAGTAGGATGGCCCTACT 

CCAGAATTCCCTK3GACTTTTATCCT 
15 TGCTTTGACGCTGTCACTTCTTTT^ 

tacagcacatgc^tacckxiatccaag 

ATGTGGCTCCXXXX!ACATACCrrcGCACAA^ 

GGAATtnXMCATGATGTTCACCT^ 

TGGGAAAAGGTGTCTTTCTGGGGTGAGm^ 
20 AGAAGGAAGTGAACAAGC^CAGCTTAAACATCAT^ 

TTGTO*TGGAATCCATTTCT^ 

TAATTTTCACACCCTCATCACTGAG 

CTGTTATTACAGGCCTCATCTCrEA 

TC^AAAAGGAAGCC^TTTGCTTTGAATTC^ 
2 5 TTGATGCTGTTGTGACCTGCAGTGACC^ 

ATCTGGGCAGC^ 

GGG<X!ACTCTTTGGCAAAGAACCTGTCTAA^^ 

AAGCACAAATTGATGCTATTCCACTAAGCCAT^ 

T(XTTCTTGOCAAGAGAATCACZ^^ 

30 CTCTGGCTGTATTAATGACCTAAGAAGATGG 
O^AGCAATCTCAGCX!AAGTCAACTC^ 
ACCAGAGCCATGAGAGTGAGTAAGACC^^ 
CK3GAAACACACAATGGTGGTAAAGAGGAAAG7VAG 
GGGGAGGAAATGQTTACTTCrrGTGTGAGGGGGTTC 

35 GGGTATGAAAGGATGTGTAGGAGTCTTCTAGGGGGCACA 
GCATGGCTTGTTGGGATGAGTTTCAAOT 
AGATTTTATGCCAATGGCTCCACTTGAGTTTC 
GATTGAGTTGTGTATGATTCTACATAGAAT^^ 
TTTGK^CAACTCCCGAGGTGGAGAAGC^ 

40 AGCTCCAGOXrTGCCTCCOT 

CAGGAAAGCTCCACACACACAGCCCAGCAAACAGCAGCA 



SUBSTITUTE SHEET (RULE 26) 



WO 02/36784 



PCT/AU01/01407 



20/22 



Figure 9. 

G(X!!AATTAGAAGAAACACAACT 

TTTGGCAGAATTACAAAATATCTCAT^ 
5 ATTCATCTGTICTCTCAAGAAAA^ 

~ CTCATTACCTTATTTTTTAAGTGTTAATATAGTTTT 

CXIAAAGTGCAACTTGTAATAGCAATTCXriTC^ 

GGGCCTGAGTTGGCCAGCTTTGQTATCT(^ 

TACTCTAGGAarrACAGCCXrrTCrc 
3 AAGAGCAAAAATAAAATCAAGATGACAGAAAATTGAT 

TGCGCCTGTCX^CI^AGC^TGGATTCC 

AATATGATAGAATGAACAGCAAGCTCAGTGTCAGA 

AATGTGTGTTTTTGGTCAAGTTCTT^ . 

CTCTACATTCTCTTGCAAGCTCAAACAT^ 
5 TTTTAAAGAAATAACXTTTAAAATTATTAGTATTCGAGGA 

TTCCTTCTGCTCTATCCCGTCT^ 

AAATTGTAAACTTTGCCCACCTGGTACTT^ 

CTCTTTTTAACZATACAAAATCACrTATTTAC^ 

AGTTCTACCTAGGC^GGGATTTTTGTATGTTT^ 
0 ACATATAC^GGGTACTTGATTAATACTX^^ 

ATTTCCTAACTTAAAATTGTAAAGTCAGATCT 

TATGGAAATATATTTTAAATAGATATGTCCTC^ 
. ACXnTACCrrCATTGAACTAACTTGACCTO 

TCTrATGAATCATAAAACGATAAAAAAAACTAAAAGGGAAATGGTC 
5 CX!ATTTCGGGTAATATTCTTATTTTTATGTCA 

GTTATAAATTAAGAGAATGAGGAAATAGGTACAT^ 

AAGGAATTTCCCTCAGGAATAATCTTCAGAATAA 

ATAGCCAGATTTCATTTTTAATTGAAACTT^ 

ATAAACACAAGTAAACATTCGGAATATAGACATC^ 
0 TTTTACAAGTAAGGAGTTTTAGTTGCAGCAAAAGAAATAA^ 

CCXJAATCTGTTTGGTGCTAAAGCCAATGTTC^ 

TTTATCAGTAAAAGGCAAAATTTGCTATTATTGA 

AATCCAACTVAAAAAGTTCC^ACTATTO 

TGACTTTGAAAAGGAAGTTGGTCACTT^ 
5 AAACAGTTGCCTTATATGCTAAAATTAT^^ 

AGGAACTTTACACGGCTCATOTI^ 

AGTTTTAGCCAGCCTGGGGAAC&XAGTG^ 

CACACXnrrTGGTCXXaGCri^CT^ 

AGTGAGCTGTGATTGCGCCACTGTGCTTCAA^^ 
0 ATTAAGTAAAGAAAAAAAAAAGAATCAAAGAAAAAAATAATTCCCC^ 

GGATAAGCTATAAAGTGTCAAATAATGCTGTTJ^ 

CACATAGTATGTGCTGTATTTTATAT^ 

ATTCTTCAGGAGAATGACACTGAAGTCCTTAATT^ 

GTATATGGTGCTTAGTTATGAAAAATACCnXX!AGGGCT^ 
5 GAAAACAGTAATTAGAAAAAAACCATGTATCCAACra 

TTATTTCAGCCrcCCATGTATTTTCTTAT^ 

CCTCTCTCACTTTTCrrTAGA^ 

TACTGGGAATTCTCAATGTTTCTTGTT^ 

ATAAGGAGCAAACTCCACTGAGAGATAGATACTAATAA^ 
0 TCCAAACTTTTATAAAAAATATTATAAAATAATAC^ 

ATTAAAGTAAACrrCAGACCTAACTTTCATCACTAAAAGAT 

CTAATATTACA(XIAATTAATTTGCTTGATATAGTGAATAT^ 

CCTTTGCATATTAGCATAGGTATTTTCTO 

TTTAAAGAGGATGTATAATAATTAATTCATCCATA 
5 TCACTGTTATAAAATAAAGCAGCAATGAAT^ 

TTAAGATAGATTTCCTAAAATTGAATTACTX3AGTCAAAC^ 

TTCATTTGAATAATTTTCAACrrCCTAC^ 

AAATATCTTTGCTOTTGTAATTTGCATT^ 

TACCAGTATACTACATACTTTTTTATATGAATTGCCTATTTGCA 
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GAAFITTI^JVLTAT 

AAATAGCTATATGGTTTGTTGTTGGTTTTAAAT^ 
TATAGATTTCTTTAAAAGTAATAAAATTATTGCCATATTAra 
TTCCAGTCATCACTTTTACTCTCATGT^ 
5 AGGATATTTGGGAAATAATACAATTTATACTAAACACACA^ 
ACCCTAGGAATATGCTGTTTTTGTCTTT^ 
TTCTTCAAATGACACAGAAATCAAGTCTTTTG 
TCATGAAGCTTGGTUITTCCAAGAGTTGCCACTTCATT^ 
CTCCTCTTAATTTTGTCATTCATTT^ 
1 0 CAGTGACTTACTCAGAGAGATGATTCTCTAATGGA 
GCXSTGGTGGCTCATGCCTATAATCCCTUK^ 
TTCAAGATCAGCCTAGTCAATGTGGTGAAACX^CCZA 

GGTGCACTCCCATAATCCCAGCTACTC^^ . 

TTGCAGTGCGCCTGGATTATTAACACTACACTC 
1 5 AGAAGTTGATAAGGGAGMAGTTCATGGCAAO^ 

GCCTACTAGTGTAATTTCCATAAATCTGCCTC^ 

TTTATTGAAACTTTATTTTGTATAAGTCTC^ 

CATTAATAATAAATACAAAATGAGAGATGCCX3ATACTC 

CTGGCTGTAGTAGATGGGGAAGTGTTCAAAGAGGACT 
20 CATGGCTAACTCACTAAGCGACTTAA 

ACTC^TGTCTGTTGCTGTATTGGCTAAATC 

TCGGAATCTGGATTTTTTTTTCA^ 

GCACAGTCATTTAATATTCCCTA^ 

TAAAAATCATTAC^TGTGAAAGGGTGGAGAGTGTGT^ 
25 CAATATTGTOTTATTATTGTTAACACAAGTG^ 

TTCnrrTGTCTGCCTCAGGTCAATTAAC^^ 

TCAATCTGGACTCAGATGGGCTA^^ 

GAGAAAGATAC^TAAGATTTACAATAAAAGAATTAT^ 

GAAGGAAAAATAGGTGAACAGCTTTTTAAT^ 
30 AGCTGGGATATGAGAATAATAGGTCAAACGTATATCGATAC^ 

GAATGCTAAGGAGGGC^ 

AGGGTGTTTCAGGC»CTGTAC3^ 

ACTATCTACTCTTTACXAAAAAGCT 

ATAATCAGCAGAGATGGTCCTTATCTAAGTTTTT^ 
35 ACTAAAGAAGAAAACTTTTGTAAATTAGTTTAGTTT^ 

CACATATCATCTGAGAAGCCCAGTGAGTTTGC^^ 

GATGATTAGAGAGAAGTGATGTGTTCTTO^ 

GTGGC^GGGAGCCATTGCAGGTTTTCCAAGAT^ 

GCAGTGAAGTGGTGTGTCCTATGATAATGTACT 
40 AGGGACTACTK3GTTAGCACXriTACCT^ 

TAATATGGGAGTAAC2UVAAATAGTAACTACTTCATA 

GAAGTGCrTGGC^GTTCCrGATAAAT^ 

TTATTACTCCCATAGATACATATAGAA<^GACrre 

GACAAGTTGGTGAAATGGCTACATGAGAGCGGAGGAC^ 
45 GA7UITATGACCGCTTGAATGAAGGTATATGAGTGTCAACT CAAGGGGACCAGGTAGGTTTCATCAGAAA 

TTAAGGAAGCTTAAGGAGAACAGCCAAGTTCAGCTT^ 

GTCTTTCAAGAAACCAAGGACTGTTGAAAGTAGCAAGAGCT 

TCACACTTTC^TGTCAGTGGAGCAAAGAAAT^ 

ATCTGTGTTATTCTGATCTTTAAC^ 
5 0 GAAATAGTATAATCAACATAACAGTATAGCIACTG^ 

CATTCTVAAAGGATACATAAAATACAACAAGATTACATAAATGA^ 

AAAAGTGAAAACAGAGATCATAGGCACAAATAAGA 

TTTACCCC^GATGGACX^CAGGGTTGTTGT^^ 

CAATTAGAAC^TGTGGAAAAAATAGTGGCCTGTTAGAAGCCTAATTAA 
5 5 AAAAAAAAAAAGACK3C(X3AGCTGTA^ 

TCACX3AGGTCAGGA<^TCAAGA^ 

ATTAGCCGGGCGTGGTGGCGGGAGCCTOTAGTCO^ 

ACCCGGGAGGCGKJAGCTTGCAGTGAGCOT 

TCC GTCTCAAAAAAGAAAAAAGAAAGAAAAACAA 
60 TATATC^AGATAAGGAAAATTTGTAGTCAAGAATAGAA 
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AGAAAGTGGAAACATCCTCAGACTAT6CAGTAAAAAA 

AACTIX^TAGGTAATATGTGAAAGTCATAGAATGT^ 

CATAAGAAGTAAGAAAAGCTCTGATGTC^ 

ACTTTTCXTTCAGTTTGAAGTAAATAGTGGACA 
5 ATTACAACTAATGTAATATGCTAAAACATGCT 

AAAATXXX!CTG!GAAAAACACGGGCATTGAT^ 

C^TTTGTACCTTGAAATACCTGTCTOT 

OVAATAAAGGATGAACAGATGTAACTCAG7UUICTC 

TCTCCTAGCCTTTTCAAAGGTGTTAG 
1 0 TTAACTGTATTAAATGCGAATCCCGAGAAAA 

CTTATGTGAACTTTGAAAGATOTGTCTACATAAGTTC 

TCTACTTGCCCTTTCrrAGAGAGGTC 

GTTTCTCGAGGAATCAGCATTCAGTCi^ 

GCAGGAACAGCGCCGGGGCXSTGGGCTGAG^ 
1 5 TTCX^GTAGCGGCTCTTCCAAGCT^ 

AGTCGGAGTATCTTCTTCCAAAATTTCACGT^ 

AAGCTGGGAGCmCTATGGGACAGTTCCCAAGTC 

AGAAGGGCTTCTTGAGGCGTGGATAGTGTGAAGTCCT . 

CTAGACrTCAGGAGCTCCrrGGA 
20 AGCXJCXSCGCGGTGCGGGGACCTGCnt^K^ 

ACTGGAACCGGGAGGGAGAATCGCACTGGCGGC^^ 

G(XrTTCGTGGAGATGCTGGAGACCCCGCGCA(^ 

GGGGCATCAA03GCGGGCX3CTCCCTCm 

TCC^GCGCX5CCTGAGGCTCATCCATTT^ 
25 AGGGG&COGCAATGGAGGAGCAAAGAAGAAG 

TCATTTTCATAGTTTACATAGTTC 

C^TGTTGTTGTTTTTAATTATTTTTAAGAAAC^ 
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<110> THE UNIVERSITY OF SYDNEY 



<120> THE UNIVERSITY OF SYDNEY 



<130> P41173 
<160> 5 



<170> Patentln version 3.1 

<210> 1 

<211>- 12983 

<212> DNA 

<213> Homo sapiens 

<400> 1 

ctggttcatc tcattgggac tggttggaca agagggtgca gcccacggag ggtgagccaa 
60 

agcagggtgg ggcgtcgcct cacctgggaa gcacaagggg tcgtggaatt ttctccccta 
120 



cccaaggaaa gccataaggg actgagcctg aggaactgtg cactctggcc cagatactgc 
180 



acttttccca tggtctttgc aacccgcaga ccaggagatt ccctccggtg cctatgccac 
240 



cagggccctg ggtttcaagc acaaaactgg gcagccattt gggcagacac cgaactagct 
300 



gcaggagttt tttttttttt tttccatacc ccattggcac ctggaacgcc agtgagacag 
360 



aaccgttcac tcccctggaa agggggctga aaccagggat ccaagtggtc tggctcggtg 
420 



ggccccaccc ccatggagcc cagcaaacaa agattcactt ggcttgaaat tcttgctgcc 
480 

agcacagcag cagtctgaga ttgacctggg accctcgaac ttggttgggt gctgtggggg 
540 

ggcatcttcc attgctgagg cttgagtagg tggttttacc ttcgcggtgt aaacaaagct 
600 

gctgggaagt ttgaactggg tggagctcac cacagctcag taaggccact gtggccagac 
660 



tgcctctctg gatttctcct ctctgggaag . gatatctctg aaaaaaaggc agcagcccca 
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720 

. gtcagggact tatagatgaa. acccccatct ccctgggaca gagcccctcg gggaagaggt 
780 

ggcttccacc attgtggaag actgtgtggc aattcctcac ggatttagaa ctagagatac 
840 

catttgaccc agcaatccca ttactgggtg tatacccata ggattataaa tcattctact 
900 

ataaagacac atgcacactt atgtttattg taacactatt tacaatagca atgacctgga 
960 

accaatccaa aagcccatca atgatagact gaataaagaa aatgtggcac atatacactg 
1020 

tggaatacta tgcagccata aaaaaggatg agttcatgtc ctttgcagag acatggatga 
1080 

agctggaaac catcattctc agcaaactag cacaataaca gaaaaccaaa cactgcatgt 
1140 

tgtcactcat aagtgggagt taaacaatga gaacacatgg acacagggag gggaacgtca 
1200 

cacactgggg catgtcgggg agtgggggcc tacgggaggg atagcattag cagaaatacc 
1260 

taatgtaggt gacgggttga tgggtgcagc aaaccaccat ggcacatata cacctatgta 
1320 

ataaaactgc acgttctgca catgtacccc agaacttaaa gtataattaa taataataat 
1380 

aatttctggg catgtaagta gctgtctttc aggttctact ttgatacata ttctgagaga 
1440 

attaaacctg tcaaagaaac cttgactttc aatggcaggc actggaattg accctaataa 
1500 

tgtgttttgg ggtaagccta ctcatattct caacctgtct gcagtagtcg ttagaatctg 
1560 

aacttcctga agttcatgtg caaagttgag ttaattgttt aatattcaac aaggattatg 
1620 

ccagtaagat ggtaggaaaa tattagatat gtgtcatcac tgctggtatt atttaaactg 
1680 

caacatattt tagctggctg ctgatctcag ccaccatgcc tgcattttat ctctgtctcg 
1740 - . . . 
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tggtctgcaa ccttggaagc tttgaactta gctcatagaa tcctgggcat caagaacatg 
1800 

tggttctaat ggctagatag ggaatgagag taaaaggatt ttgcccacgg tcacgtgagt 
1860 

aaacaacaga tttggagggg tctggactac tgtgatgact tcattctgac aatatgttcc 
1920 

agttgtcctt tcatttcctc ctaatcacat gtctggtctg atctggctgt ttcccacctt 
1980 

ccaattcctg ccttctccaa tgctcccttc cgtaggtcac tctgtggctc agagaccctg 
2040 

cttagcaagc gcccaacctt tcaattattt gttcagtaaa acttgaactc atgtctcccc 
2100 

ttcttgataa aaagaaaata cgttatgtaa tgtcgggtta ctctataact cttgtcctgt 
2160 

ctctcggcaa ctactgaact aactgttttc atattgagca aacgtttatg gaaggactgc 
2220 

caagagtcag gtactaggct tggtaatatt ccccgttctc tctagtcaaa gccaacacca 
2280 

gccagacttg cagatctagg tcccaagccc actgcagatc acaggccagg gtctggtctc 
2340 

ctctgagctc ctttgggagg gaaagacaga attattaaca cccattttgt agattaggca 
2400 

actgaggctg aggaagttta aataactcag acagggcctg cacgtcagtc atattccaag 
2460 

gatccctact cactgtcttc tctctacaga acgagatgtc tctggagtcc atagaaagcc 
2520 

caggagcctg gctgggcacg gtggctcctg cctgtaatcc cagcactttg ggaggccgag 
2580 

gcaggcagat cacctgagct caggagttca agaccagcct gggcaacatg gcaaaacccc 
2640 

atctctacta aaaatacaaa aaattagctg ggcgtggtgg tgcatgcctc taatcccagc 
2700 

tacttgggag gctgaggcac aagaattgct tgagcccagg aggcagcagt tgcagtgagc 
2760 
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tgagattgtg ccagtgcact 
2820 

aaacaaacac aaacaaacaa 
2880 

cagccacacc tttttcttgt 
2940 



tttgtggttg cacaaattca 
3000 

ctattttttt tttgaaggaa 
3060 

tcacccattg ctggtgagaa 
3120 

tcctcaaaaa gttcaacaga 
3180 

ctgagaggat taaaaacagt 
3240 



tcacaagggc caaaaggtgc 
3300 



gtggtctatc catacaatgg 
3360 



gctacagaat ggatgaaccg 
3420 



tcactttact atgaagtgaa 
3480 



ccagaagagg gaagcccaca 
3540 



aggggagagt ggggagaaac 
3600 



attttgcaac tagatagagg 
3660 



ttttttcacc ttaaaatggt 
3720 



ttaattttat gttatgtgaa 
3780 



tagcaggcat ggaacaggtc 



Untitled.ST25.txt 
ccagcctggg caacagagca 

acaaaaatag aaagcccagg 

cctcctctgt ctctggcatc 

aaatcccaga aaaattacca 

gataacaagt gttgacaaga 

tgtaatacgg tgctgctgct 

atttcaatgt gacccagcaa 

tactaaaata cacggactca 

taatagccca catgtccatc 

aatattattc ggccataaat 

caaaaaaaat ggatgaacac 

ggccagaaac gaagtccata 

gagacagaat gtgcaatggt 

tgctcaactg gtacaggctt 

tagtgattgc agaacacaga 

taattttcag tcctgagatt 

tttcatccct atacatattt 

actgtggtgc ctgccaagcc 
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agattccatt tcaaaaacaa 
gaccacctgc gtcaggttcc 
ttctcacagg ttcctaattg 
cttcacaccc actcagatgg 
acatggagaa attggaattc 
atggaaaaca gcttggagtt 
ttcccctcta agttatagat 
catatttcta acagtccaat 
gatggatgga taaataaatt 
ggaatgaagt actgacgcat 
atgctacaga atggatagcc 
tattgcatca tacaaaatat 
ggatgccagg gtctggggag 
tattttggaa tgatgggaac 
atgtactgaa ttccactgat 
ggataatcat aaaaaaatgg 
taaacctcag aaatatacac 
cggtgatgtt atctggggtc 
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3840 

cccggccagc cttaagcctc ttgctgaccg gtggagggca gaacctttgc cctaaaagta 
3900 

taatatccac atgctggcat gattcctggc cagatggctt ctttattagc agtaattgaa 
3960 

actgcctcga tacagacact gtaccttgca accaaaaaat gactcaacaa tgataataag 
4020 

ggttaagctg ggcctttctc tctttgccag ttaaattata tttattatag cttgacatga 
4080 

aaaacaaagc aactccaaca ggtatcacaa gggcaaagga catgaacatt ttatcaaaga 
4140 

agaaatgcag ctgtcaaaaa tacagaaata ttcaaccttg ttcataataa agtggctggg 
4200 

ctcagtggtt catgcctgta atcccagtgc tttgcaaggc tgagacagga ggatcatttg 
4260 

aagccagaag ttcaagacca tcctaggcaa gtcagttqaa taccagaett catgtctaca 
4320 

aaacatcaaa aaattagcca ggcatggtga tgcatgcctg ttgtcccagc tactcaggag 
4380 

gctgaggcag gagaattgct tgagcctggg aggctgcggt ggcggtgagc catgattgtg 
4440 

ccattgtact ccagcctggg caatgcagca agactgtcta aataacaaaa ataatagtaa 
4500 



agaaaaggat tgggatgcca tttacttgcg tattcaatac acagagttaa aagtaatttc 
4560 

tacgttttct atttttttat tactaaaaaa agctggacca ttctcacagc ctgaaatgct 
4620 

tctcactttc ccttcttctg tccaaacact tctctatgat aatgcaaaca gtcactcctt 
4680 

taggaagact tcaccccagg tagttccaga tccccttatc tctgccttcc cagaactcct 
4740 

ggtgtctctc cagttccctc cgtgtggtga agtaccctac ctagggtttc agtatggctc 
4800 



tgtctgcaaa ggtcttgttc acaccttccc ttatggttct gttgccctgt gttgtgtcat 
4860 
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agcacagggc acagtggaga acccattcac actgatagag agggccccat ggtcctggag 
4920 

ataaccatgt aaccgatcag aataaggcat tgagggctgg gtgtcaggcg tgggctgcac 
4980 

ttgggtgggc aggtcccctg gaaagtcact gggtttggca agcttcctag taacatgtct 
5040 

ctctggggtc ccccttggaa cttcatgcaa aaatgctggt tgctggttta ttctagagag 
5100 



atggttcatt cctttcattt gattatcaaa gaaactcatg tcccaattaa aggtcataaa 
5160 



gcccagtttg taaactgaga tgatctcagc tgaatgaact tgctgaccct ctgctttcct 
5220 



ccagcctctc ggtgcccttg aaatcatgtc ggttcaagca gcctcatgag gcattacaaa 
5280 

gtttaattat ttcagtgatt attaaacctt gtcctgtgtt gaccccaggt gaatcacaag 
5340 

ctgaacttct gacaagaaca agctatcata ttcttttcaa ttacagaaaa aagtaagtta 
5400 

attgatagga ttttttttgt ttaaaaaaaa tgttactagt tttgaaaagg taatatgtgc 
5460 



acatggtaaa cactaagaag gtataagagc ataatgcttt tatactacta agaataatgt 
5520 



tttctctaag ttttttttgg tagatgcttt catcagatta agaaaattcc ctgctattag 
5580 



ttgttgaagg tttttatatc ataaatgaaa gttgaatatt attatcatat attattaata 
5640 



tattgttatt gaactatcaa agccttttcc taaaaccatt gagatgatct tataaccatt 
5700 



ctcctttaac ctgttgacga gatcattggt atttatacta tttctctgtt aaccattctt 
5760 



gagtctcagg tttaaattca acttggtcat ggtgtgtcat ctttgatcat tgctgtctgt 
5820 



ggcttgctac tgttttgttt aggatttttg cactgatgct catcaatgag actggcatgc 
5880 
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catcttcctt tgcagtcctg 
5940 

gaatgagttg ggcatgatgc 
6000 

ctctccagat ggccctcaat 
6060 

tctcatttct ggtaggccca 
6120 

tcggagtctc actctgtcac 
6180 

ctgcaacctc cacctcccag 
6240 

aattacaggc acccaccgac 
6300 

tcacaatatt ggccaagctg 
6360 

cccaaagtgt tgagattaca 
6420 

agctccccat gcatgtgaat 
6480 

tgttggaact gtgagaaagc 
6540 

ggaagaggct tgttggggag 
6600 



accagatgga agctgggttg 
6660 



gagacacgga tatgtcattt 
6720 

aaacatcatg cacacttgga 
6780 

gatcagaaag ggtcttgtgt 
6840 



ggagcagttg gagggactga 
6900 

agacgctttt ctggtggctg 



Untitled. ST25 . txt 
atttttttct gatttggatc 

ctttttttca tgtctctgga 

ggtccctgcc tcctcattgt 

ggaacctgtg ggggttttgt 

ccaggctgga gttggagtgc 

attcaagcaa ttctcctgcc 

acaccctgct aatttttgta 

gtctcgaact cctgatctca 

agcatgagcc accacaccca 

gctgtgagca tcccaggatg 

accagtggga ccttctccag 

atgatgccct ggctgactcc 

gacccactct atgctgaaga 

ttgtagagcc tgaggagtgt 

aaagtggact gagaccgaat 

tacttcctag agatacttag 

agacaaggaa agaaacatgt 

agtagggaat tccctggata 
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atgtggttat ggccctcatg 
ttgatgggac actttggatt 
taggcccctg ggcaagccct 
ttgtttgttt gtttcttgag 
aatggcccga tcttggctca 
tcagcctcct gagtagctgg 
tttttagtac agatggggtt 
tgatctgccc ggcttggcct 
gtgaacctgt ggtttttaga 
acagccactg tgtgttcagc 
cacctgcctg ctgagttcat 
tgaaggatgg ttaggaatgc 
acagcttgtg tggacacaag 
ccaatcacac catttgctta 
gaagaagcta acagtggcca 
attttatcct gtgggtgata 
ttcaagatct atgtttttca 
agtcctgccc agggtcaggc 
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6960 

aaaacaagtt agggggttac 
7020 

gttttgtaac acatctcatg 
7080 

tcccttccac gtgctcttct 
7140 

ggtttcatca ccttttactt 
7200 

gtcagaatta tttgcaacat 
7260 

tttagtaaag tgtaggatgc 
7320 



tataactaca ctggcagtgt 
7380 

tgacacaacc tgccaagtta 
7440 

gcacaatttg taatcacagc 
7500 

gcaagtctct ttccagctgt 
7560 

cactactcat cttacaaagt 
7620 



aactgtgaag tgccatggag 
7680 



tggagtagaa agccaacagt 
7740 

ataaagaaaa gaggtttaat 
7800 

tctcagtcat agcaggaggc 
7860 

gagagagctc ctgttctttt 
7920 



gggcaggcag agagaaggaa 
7980 
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tgaaataagg agtatgagaa 

atgatcttca tttccttcac 

gaaatttgcc tcacattctc 

tttgctttcc tggaaacaca 

ttgcctttct gctgaaacca 

acatgtcgtt ttcgtggtca 

gctgggaggt gtagagagaa 

ttttaggagc ctccttggaa 

atcctgctcc atgccttggc 

ctgttcccat gtctataaag 

tttcttgctg atgttaagag 

atgttagtgg ttactttatc 

tatgattgag tcctcctcct 

tgactcacag ttccatatgg 

aaaggggaag aaggcacctt 

ttgtcataaa gtctacagaa 

ggacattgct tcaccccagc 
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atggtgtagg ttgtgctgac 
taatttcctg tttcattaat 
tgatttctct tttacctgtt 
aatgattctg attgtgacat 
tgagttcact gaatacacaa 
caaccagctc tgtagcattt 
atatttatca catgtgtggc 
tcccagcaag aatgctaccg 
ttcatggcat agtcacttct 
tatgagttaa atcatcctaa 
agttgggaaa gaactgtata 
aagaaataga cactctagaa 
cttcttcttt ttattaattt 
ctggggaggc ctcgggaaac 
cttcacaagg cggcaggaga 
gtgcttatac ttcaggacaa 
cctcactgac gagtttgcta 
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ggggacctca ctttgtccca gagtagggca gaactctggc cactacccat tcagaaggcc 
8040 

tgggctgcac tgctagttcc tcactaactc tgtgtggcct tgggcaaggt tgggcctgtg 
8100 

ttaacagatt atgaccctgg gctctcaagc tagaggatct aaatttgaat cctggctctg 
8160 

ctaaagcaat tagtgatgta aactttaatg ggtcagttaa ccttcctgtg gcttagtttg 
8220 

ctcatctgta aaatagggat cataacagta tcaataccac atgattgttg gacagattga 
8280 

atcagttaat gcaggggaag tacttagcat gacacgtatt cactatcatt tcctggagta 
8340 

agagctgtgt gtgagtgggt gtgagcatgt gtgaaacctt ttctctgcaa tctcagttaa 
8400 

gaaaccaatc cagaatttaa agttcagggc ctaaatgggt ggttatcttc tcccagttcc 
8460 

atcctatccc acctttgctc ttcctcccgc ccacaggagc tgttggtcct tgattgggct 
8520 

ggaagacctg gtggacccta agtgatctat aagaggagaa tagagaacag ggaatgtctt 
8580 

caaaaatcta gagggacaca gaggctgaga ggcaggcagt cctgcagggt cttctgattg 
8640 

ggacaaggag aaccttggtc ttcacaggcc aattctggtc agtttccccc atggacagat 
8700 



gaggaaacag gcccaggaat atccaaggtc tcacacttcc catctgtcaa gtcttgttga 
8760 

ttctgttgta ttcatgtctc tcaaagggag atagagttta gggaagaaag aaggatcaac 
8820 

tgtgtctgat accactggga gcttaagtaa agggttcttt tacttcatag catttatccc 
8880 

aatttgtaat tcagtattat ttgtgtggct gtttggtgtc tctttctcct atatgagtgc 
8940 

tagcttcata agggcaagga ttttgattct ttaatattta gtgcttgcca catgccctga 
9000 
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acacagcagg catacaggct 
9060 

cctacctcct ccagtgccaa 
9120 

ccccaagcat gaggcccaaa 
9180 

ggccaggtgt ggtggctccc 
9240 



tcacttgagg ccaggagttt 
9300 



aaaaattagc tgggcgttgt 
9360 



gaggatcact tgagcccagg 
9420 

ctagcctggg caacagagtg 
9480 

aaacttagat accacgtggt 
9540 

aatggggact tgacagctga 
9600 

gtttacatcc tcaagatcat 
9660 

gcaatgttgc ctccgcatac 
9720 

tcccagaaga gaggtaccac 
9780 

ggtgttcttt acaacgttga 
9840 

taatgcttct ctgaatgttg 
9900 



gataagtctt tcattgttga 
9960 

cttagctgta ctataattat 
10020 

ttgaccaaga ccaactttgg 



Untitled . ST25 . txt 
aaccaacata cagtggcatg 

gagagcataa ccatgcacct 

agcattagct aatcccctcc 

atctgaaatc ccagaacttc 

gagatcagcc tgggcaacat 

tgcatgcctg tagtcccagc 

aggtggaaac aacagtaagc 

acaccctgcc tcaaaacaat 

caccccaaca tgcaaaatca 

gttacagaga gataatcttc 

gacttgtgaa atttgaatcg 

catcagcaat tcacttggct 

ttgggctacc aatataaaag 

gtccctggac agcctgtcca 

agattgaact ttgatccaat 

taaggacatt atgtttctca 

ctgcttattt gtctctgctc 

ttggaagcat agtactaaga 
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aaagtcatga aagtgagaca 
gtcactctcc tcaacaccac 
tccagccact aaaacttaaa 
aggagacagc agcaggagga 
agctaggtcc catctgtact 
tactaaggag gctgaggtgg 
tataatcaca gcactgaact 
tttaaaaata aataagagca 
agttttcccc tactgagaag 
ttcttctttt tttttttttg 
aatacacatg taattccaga 
actggaagtc aggataagct 
gatgaaaata tcagagtgat 
ctgatgctga tatctgagcc 
gaaactagaa cgagaaagaa 
tacttgtatg attatttttc 
tatgtgctta gggtacaaag 
gcacagtact gagagcacag 
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10080 

tattgagagc acagctttaa 
10140 

agaggcatgt ggtggttgga 
10200 

aagtccctac atgttagaag 
10260 

aattattttc ctttgcagaa 
10320 

tttaagtccc aagcgatgat 
10380 

acctcctaaa gtttgtaggg 
10440 

gtatgtgttt tagaggcaag 
10500 

aaatataaga aaggctctcg 
10560 

acactggaat ctttacttct 
10620 

gagggagatc ctccagcaat 
10680 

gctgagatga gatggagtcc 
10740 

ccttgagaca ggaacacaaa 
10800 

tcccatagtg atgcgtagca 
10860 

ctcccttcct gttagctttt 
10920 

ttgcagtcac tgagcctcag 
10980 

cttaatgtgc tcagcaaagc 
11040 

ttttatccta cctctcttgg 
11100 
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aaaacatgat gaaggcttta 

tgtatcttcc ttgacacagt 

atgttacctt ctgtggaatt 

caacaccaac tgcattagtt 

tagtctctca ctgttggtat 

caggtaaatc ctcatcttag 

aagaaaacat atttgtttct 

gtttaggtga ggtaatgaag 

ctgaacgtgt tctgtgcatc 

gccactgata tggtcagaaa 

tttgttctag aagacccatg 

tcccaaccaa tttgtggttg 

gggaatggca ggtgcaccag 

tataaagttt tattgtgatc 

tttctacatc tgtaaactgg 

cactgaaagg agacagaaat 

ggattgtcac caccttccca 
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atacaggaaa tgagcagggg 
cagtgcagct ctcagtagtc 
aagtggcaga acttgccttc 
aggacacagt gctggctgca 
agattcaaac caatcagacc 
aataaaaatc atcttaccaa 
gtaagagttt tgtttaaaaa 
ttgttgatag ttatcagatg 
tctcagtgtg ggaacataga 
ctgcatcttt ctttctccct 
gtggtgccgc tgggagtaac 
cagccttgag tctcactatt 
agcaggagag gacctaatat 
agtagcagtt gggaagctac 
ggatagtagc atggccccta 
gtatctaaat taccctggac 
tgtttgtcct ttttggtttg 
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atgcttgctg tcacttcttt 
11160 



tccagagtta cagcacatgc 
11220 



taggctgtcc ccaaggaaca 
11280 



cattcaccca cttggccttg 
11340 



ccaggtgccg agcagccctg 
11400 



ggggtgagta tcagtttctg 
11460 



gcgaacaagc acagcttaaa 
11520 



tgtttgtcat gcaatccatt 
11580 



ataatgagct gatataattt 
11640 



tcagggagct cacaggtggc 
11700 



ggcctgccct cctcccatta 
11760 



aactccaatt ccacccccaa 
11820 



acctgcagtg accactgccc 
11880 



ctgggcagct gttctcttct 
11940 



gagaaggggc cactctttgg 
12000 



gaagggttca caggaagcag 
12060 

catccatgcc ctgtctctcc 
12120 



Untitled.ST25.txt 
ccttaggtgc ctctctgtac 

ataccaccat ccaagcatgt 

tgtggctccc ggcacacacc 

aatctgacaa ggaatctggc 

gaggcttagg ggccagaggg 

caggagggct gaatgtgaga 

catcgcctat ttctattgag 

catcaggcca ggcagacaca 

tcacaccctc atcactgaga 

agcaactgct attacaggcc 

gaaaatcctc cacttgtcaa 

gaggctggga ccatcttact 

catcattgct ggctgaggtg 

ctcctttctc tcctgtttcc 

caaagaacct gtctaacttg 

cacaaattga tactattcca 

tttaggggtc cccttgccaa 
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ggctctttta tcccagggat 
ttatttgtct cctgcttcac 
tggcacaaca ctgcacatga 
atgatgttca cccactcagg 
atgggaaaag gtgtctttct 
aagaataaag agagaaggaa 
ttttaagaac gctgtgattt 
gaacttgggt gtgagtgacg 
tctctcccat caggaatggg 
tcatctctac cagctcctgg 
aaaggaagcc atttgctttg 
ggagtccttg atgctgtgtg 
gttggggtcc atctggctat 
agacatgcag tatttccaga 
ctatctatgg caggaccttt 
ccaagccatc agctccatct 
cagaatcaca gaggaccagc 
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ctgaaagtgc agagacagca gctgaggcac agccaagagc tctggctgta ttaatgacct 
12180 

aagaagtcac cagaaagtca gaagggatga catgcagagg cccagcaatc tcagctaagt 
12240 

caactccacc agcctttcta gttgcccact gtgtgtacag caccctggta gggaccagag 
12300 

ccatgacagg gaataagact agactatgcc cttgaggagc tcacctctgt tcagggaaac 
12360 

aggcgtggaa acacaatggt ggtaaagagg aaagaggaca ataggattgc atgaagggga 
12420 

tggaaggtgc ccaggggagg aaatggttac atctgtgtga ggagtttggt gaggaaagac 
12480 

tctaagagaa ggctctgtct gtctgggttt ggaaggatgt gtaggagtct tctagggggc 
12540 

acaggcacac tccaggcata ggtaaagatc tgtaggtgtg gcttgttggg atgaatttca 
12600 

agtattttgg aatgaggaca gccatagaga caagggcaag agagaggcga tttaatagat 
12660 

tttatgccaa tggctccact tgagtttctg ataagaaccc agaacccttg gactccccag 
12720 

taacattgat tgagttgttt atgatacctc atagaatatg aactcaaagg aggtcagtga 
12780 

gtggtgtgtg tgtgattctt tgccaacttc caaggtggag aagcctcttc caactgcagg 
12840 

cagagcacag gtggccctgc tactggctgc agctccagcc ctgcctcctt ctctagcata 
12900 

taaacaatcc aacagcctca ctgaatcact gctgtgcagg gcaggaaagc tccatgcaca 
12960 

tagcccagca aagagcaaca cag 
12983 



<210> 2 

<211> 9715 

<212> DNA 

<213> Homo sapiens 

<400> 2 
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ctggttcatc tcattgggac tggttggaca agagggtgca gcccacggag ggtgagccaa 
60 



agcagggtgg ggcgtcgcct cacctgggaa gcacaagggg tcgtggaatt ttctccccta 
120 



cccaaggaaa gccataaggg actgagcctg aggaactgtg cactctggcc cagatactgc 
180 



acttttccca tggtctttgc aacccgcaga ccaggagatt ccctccggtg cctatgccac 
240 



cagggccctg ggtttcaagc acaaaactgg gcagccattt gggcagacac cgaactagct 
300 



gcaggagttt tttttttttt tttccatacc ccattggcac ctggaacgcc agtgagacag 
360 



aaccgttcac tcccctggaa agggggctga aaccagggat ccaagtggtc tggctcggtg 
420 



ggccccaccc ccatggagcc cagcaaacaa agattcactt ggcttgaaat tcttgctgcc 
480 



agcacagcag cagtctgaga ttgacctggg accctcgaac ttggttgggt gctgtggggg 
540 



ggcatcttcc attgctgagg cttgagtagg tggttttacc ttcgcggtgt aaacaaagct 
600 



gctgggaagt ttgaactggg tggagctcac cacagctcag taaggccact gtggccagac 
660 



tgcctctctg gatttctcct ctctgggaag gatatctctg aaaaaaaggc agcagcccca 
720 



gtcagggact tatagatgaa acccccatct ccctgggaca gagcccctcg gggaagaggt 
780 



ggcttccacc attgtggaag actgtgtggc aattcctcac ggatttagaa ctagagatac 
840 



catttgaccc agcaatccca ttactgggtg tatacccata ggattataaa tcattctact 
900 



ataaagacac atgcacactt atgtttattg taacactatt tacaatagca atgacctgga 
960 



accaatccaa aagcccatca atgatagact gaataaagaa aatgtggcac atatacactg 
1020 



tggaatacta tgcagccata aaaaaggatg agttcatgtc ctttgcagag acatggatga 
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1080 



agctggaaac catcattctc agcaaactag cacaataaca gaaaaccaaa cactgcatgt 
1140 



tgtcactcat aagtgggagt taaacaatga gaacacatgg acacagggag gggaacgtca 
1200 



cacactgggg catgtcgggg agtgggggcc tacgggaggg atagcattag cagaaatacc 
1260 



taatgtaggt gacgggttga tgggtgcagc aaaccaccat ggcacatata cacctatgta 
1320 



ataaaactgc acgttctgca catgtacccc agaacttaaa gtataattaa taataataat 
1380 



aatttctggg catgtaagta gctgtctttc aggttctact ttgatacata ttctgagaga 
1440 



attaaacctg tcaaagaaac cttgactttc aatggcaggc actggaattg accctaataa 
1500 



tgtgttttgg ggtaagccta ctcatattct caacctgtct gcagtagtcg ttagaatctg 
1560 



aacttcctga agttcatgtg caaagttgag ttaattgttt aatattcaac aaggattatg 
1620 



ccagtaagat ggtaggaaaa tattagatat gtgtcatcac tgctggtatt atttaaactg 
1680 



caacatattt tagctggctg ctgatctcag ccaccatgcc tgcattttat ctctgtctcg 
1740 



tggtctgcaa ccttggaagc tttgaactta gctcatagaa tcctgggcat caagaacatg 
1800 



tggttctaat ggctagatag ggaatgagag taaaaggatt ttgcccacgg tcacgtgagt 
1860 



aaacaacaga tttggagggg tctggactac tgtgatgact tcattctgac aatatgttcc 
1920 



agttgtcctt tcatttcctc ctaatcacat gtctggtctg atctggctgt ttcccacctt 
1980 



ccaattcctg ccttctccaa tgctcccttc cgtaggtcac tctgtggctc agagaccctg 
2040 



cttagcaagc gcccaacctt tcaattattt gttcagtaaa acttgaactc atgtctcccc 
2100 
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ttcttgataa aaagaaaata cgttatgtaa tgtcgggtta ctctataact cttgtcctgt 
2160 

ctctcggcaa ctactgaact aactgttttc atattgagca aacgtttatg gaaggactgc 
2220 

caagagtcag gtactaggct tggtaatatt ccccgttctc tctagtcaaa gccaacacca 
2280 



gccagacttg cagatctagg tcccaagccc actgcagatc acaggccagg gtctggtctc 
2340 



ctctgagctc ctttgggagg gaaagacaga attattaaca cccattttgt agattaggca 
2400 



actgaggctg aggaagttta aataactcag acagggcctg cacgtcagtc atattccaag 
2460 



gatccctact cactgtcttc tctctacaga acgagatgtc tctggagtcc atagaaagcc 
2520 



caggagcctg gctgggcacg gtggctcctg cctgtaatcc cagcactttg ggaggccgag 
2580 



gcaggcagat cacctgagct caggagttca agaccagcct gggcaacatg gcaaaacccc 
2640 



atctctacta aaaatacaaa aaattagctg ggcgtggtgg tgcatgcctc taatcccagc 
2700 



tacttgggag gctgaggcac aagaattgct tgagcccagg aggcagcagt tgcagtgagc 
2760 



tgagattgtg ccagtgcact ccagcctggg caacagagca agattccatt tcaaaaacaa 
2820 



aaacaaacac aaacaaacaa acaaaaatag aaagcccagg gaccacctgc gtcaggttcc 
2880 



cagccacacc tttttcttgt cctcctctgt ctctggcatc ttctcacagg ttcctaattg 
2940 



tttgtggttg cacaaattca aaatcccaga aaaattacca cttcacaccc actcagatgg 
3000 



ctattttttt tttgaaggaa gataacaagt gttgacaaga acatggagaa attggaattc 
3060 



tcacccattg ctggtgagaa tgtaatacgg tgctgctgct atggaaaaca gcttggagtt 
3120 
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tcctcaaaaa gttcaacaga atttcaatgt gacccagcaa ttcccctcta agttatagat 
3180 

ctgagaggat taaaaacagt tactaaaata cacggactca catatttcta acagtccaat 
3240 

tcacaagggc caaaaggtgc taatagccca catgtccatc gatggatgga taaataaatt 
3300 

gtggtctatc catacaatgg aatattattc ggccataaat ggaatgaagt actgacgcat 
3360 

gctacagaat ggatgaaccg caaaaaaaat ggatgaacac atgctacaga atggatagcc 
3420 

tcactttact atgaagtgaa ggccagaaac gaagtccata tattgcatca tacaaaatat 
3480 

ccagaagagg gaagcccaca gagacagaat gtgcaatggt ggatgccagg gtctggggag 
3540 

aggggagagt ggggagaaac tgctcaactg gtacaggctt tattttggaa tgatgggaac 
3600 

attttgcaac tagatagagg tagtgattgc agaacacaga atgtactgaa ttccactgat 
3660 

ttttttcacc ttaaaatggt taattttcag tcctgagatt ggataatcat aaaaaaatgg 
3720 

ttaattttat gttatgtgaa tttcatccct atacatattt taaacctcag aaatatacac 
3780 

tagcaggcat ggaacaggtc actgtggtgc ctgccaagcc cggtgatgtt atctggggtc 
3840 

cccggccagc cttaagcctc ttgctgaccg gtggagggca gaacctttgc cctaaaagta 
3900 

taatatccac atgctggcat gattcctggc cagatggctt ctttattagc agtaattgaa 
3960 

actgcctcga tacagacact gtaccttgca accaaaaaat gactcaacaa tgataataag 
4020 

ggttaagctg ggcctttctc tctttgccag ttaaattata tttattatag cttgacatga 
4080 

aaaacaaagc aactccaaca ggtatcacaa gggcaaagga catgaacatt ttatcaaaga 
4140 

agaaatgcag ctgtcaaaaa tacagaaata ttcaaccttg ttcataataa agtggctggg 
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4200 

ctcagtggtt catgcctgta atcccagtgc tttgcaaggc tgagacagga ggatcatttg 
4260 

aagccagaag ttcaagacca tcctaggcaa gtcagttcaa taccagactt catgtctaca 
4320 

aaacatcaaa aaattagcca ggcatggtga tgcatgcctg ttgtcccagc tactcaggag 
4380 

gctgaggcag gagaattgct tgagcctggg aggctgcggt ggcggtgagc catgattgtg 
4440 

ccattgtact ccagcctggg caatgcagca agactgtcta aataacaaaa ataatagtaa 
4500 

agaaaaggat tgggatgcca tttacttgcg tattcaatac acagagttaa aagtaatttc 
4560 

tacgttttct atttttttat tactaaaaaa agctggacca ttctcacagc ctgaaatgct 
4620 

tctcactttc ccttcttctg tccaaacact tctctatgat aatgcaaaca gtcactcctt 
4680 

taggaagact tcaccccagg tagttccaga tccccttatc tctgccttcc cagaactcct 
4740 

ggtgtctctc cagttccctc cgtgtggtga agtaccctac ctagggtttc agtatggctc 
4800 

tgtctgcaaa ggtcttgttc acaccttccc ttatggttct gttgccctgt gttgtgtcat 
4860 

agcacagggc acagtggaga acccattcac actgatagag agggccccat ggtcctggag 
4920 

ataaccatgt aaccgatcag aataaggcat tgagggctgg gtgtcaggcg tgggctgcac 
4980 

ttgggtgggc aggtcccctg gaaagtcact gggtttggca agcttcctag taacatgtct 
5040 

ctctggggtc ccccttggaa cttcatgcaa aaatgctggt tgctggttta ttctagagag 
5100 

atggttcatt cctttcattt gattatcaaa gaaactcatg tcccaattaa aggtcataaa 
5160 

gcccagtttg taaactgaga tgatctcagc tgaatgaact tgctgaccct ctgctttcct 
5220 
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ccagcctctc ggtgcccttg aaatcatgtc ggttcaagca gcctcatgag gcattacaaa 
5280 



gtttaattat ttcagtgatt attaaacctt gtcctgtgtt gaccccaggt gaatcacaag 
5340 



ctgaacttct gacaagaaca agctatcata ttcttttcaa ttacagaaaa aagtaagtta 
5400 



attgatagga ttttttttgt ttaaaaaaaa tgttactagt tttgaaaagg taatatgtgc 
5460 



acatggtaaa cactaagaag gtataagagc ataatgcttt tatactacta agaataatgt 
5520 

tttctctaag ttttttttgg tagatgcttt catcagatta agaaaattcc ctgctattag 
5580 

ttgttgaagg tttttatatc ataaatgaaa gttgaatatt attatcatat attattaata 
5640 



tattgttatt gaactatcaa agccttttcc taaaaccatt gagatgatct tataaccatt 
5700 



ctcctttaac ctgttgacga gatcattggt atttatacta tttctctgtt aaccattctt 
5760 



gagtctcagg tttaaattca acttggtcat ggtgtgtcat ctttgatcat tgctgtctgt 
5820 



ggcttgctac tgttttgttt aggatttttg cactgatgct catcaatgag actggcatgc 
5880 



catcttcctt tgcagtcctg atttttttct gatttggatc atgtggttat ggccctcatg 
5940 



gaatgagttg ggcatgatgc ctttttttca tgtctctgga ttgatgggac actttggatt 
6000 



ctctccagat ggccctcaat ggtccctgcc tcctcattgt taggcccctg ggcaagccct 
6060 



tctcatttct ggtaggccca ggaacctgtg ggggttttgt ttgtttgttt gtttcttgag 
6120 



tcggagtctc actctgtcac ccaggctgga gttggagtgc aatggcccga tcttggctca 
6180 



ctgcaacctc cacctcccag attcaagcaa ttctcctgcc tcagcctcct gagtagctgg 
6240 
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aattacaggc acccaccgac 
6300 

tcacaatatt ggccaagctg 
6360 

cccaaagtgt tgagattaca 
6420 

agctccccat gcatgtgaat 
6480 

tgttggaact gtgagaaagc 
6540 

ggaagaggct tgttggggag 
6600 

accagatgga agctgggttg 
6660 



gagacacgga tatgtcattt 
6720 

aaacatcatg cacacttgga 
6780 

gatcagaaag ggtcttgtgt 
6840 



ggagcagttg gagggactga 
6900 



agacgctttt ctggtggctg 
6960 



aaaacaagtt agggggttac 
7020 

gttttgtaac acatctcatg 
7080 

tcccttccac gtgctcttct 
7140 

ggtttcatca ccttttactt 
7200 

gtcagaatta tttgcaacat 
7260 



tttagtaaag tgtaggatgc 
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acaccctgct aatttttgta 

gtctcgaact cctgatctca 

agcatgagcc accacaccca 

gctgtgagca tcccaggatg 

accagtggga ccttctccag 

atgatgccct ggctgactcc 

gacccactct atgctgaaga 

ttgtagagcc tgaggagtgt 

aaagtggact gagaccgaat 

tacttcctag agatacttag 

agacaaggaa agaaacatgt 

agtagggaat tccctggata 

tgaaataagg agtatgagaa 

atgatcttca tttccttcac 

gaaatttgcc tcacattctc 

tttgctttcc tggaaacaca 

ttgcctttct gctgaaacca 

acatgtcgtt ttcgtggtca 
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tttttagtac agatggggtt 
tgatctgccc ggcttggcct 
gtgaacctgt ggtttttaga 
acagccactg tgtgttcagc 
cacctgcctg ctgagttcat 
tgaaggatgg ttaggaatgc 
acagcttgtg tggacacaag 
ccaatcacac catttgctta 
gaagaagcta acagtggcca 
attttatcct gtgggtgata 
ttcaagatct atgtttttca 
agtcctgccc agggtcaggc 
atggtgtagg ttgtgctgac 
taatttcctg tttcattaat 
tgatttctct tttacctgtt 
aatgattctg attgtgacat 
tgagttcact gaatacacaa 
caaccagctc tgtagcattt 
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7320 

tataactaca ctggcagtgt gctgggaggt gtagagagaa atatttatca catgtgtggc 
7380 

tgacacaacc tgccaagtta ttttaggagc ctccttggaa tcccagcaag aatgctaccg 
7440 

gcacaatttg taatcacagc atcctgctcc atgccttggc ttcatggcat agtcacttct 
7500 

gcaagtctct ttccagctgt ctgttcccat gtctataaag tatgagttaa atcatcctaa 
7560 

cactactcat cttacaaagt tttcttgctg atgttaagag agttgggaaa gaactgtata 
7620 

aactgtgaag tgccatggag atgttagtgg ttactttatc aagaaataga cactctagaa 
7680 

tggagtagaa agccaacagt tatgattgag tcctcctcct cttcttcttt ttattaattt 
7740 

ataaagaaaa gaggtttaat tgactcacag ttccatatgg ctggggaggc ctcgggaaac 
7800 

tctcagtcat agcaggaggc aaaggggaag aaggcacctt cttcacaagg cggcaggaga 
7860 

gagagagctc ctgttctttt ttgtcataaa gtctacagaa gtgcttatac ttcaggacaa 
7920 

gggcaggcag agagaaggaa ggacattgct tcaccccagc cctcactgac gagtttgcta 
7980 

ggggacctca ctttgtccca gagtagggca gaactctggc cactacccat tcagaaggcc 
8040 

tgggctgcac tgctagttcc tcactaactc tgtgtggcct tgggcaaggt tgggcctgtg 
8100 

ttaacagatt atgaccctgg gctctcaagc tagaggatct aaatttgaat cctggctctg 
8160 

ctaaagcaat tagtgatgta aactttaatg ggtcagttaa ccttcctgtg gcttagtttg 
8220 

ctcatctgta aaatagggat cataacagta tcaataccac atgattgttg gacagattga 
8280 

atcagttaat gcaggggaag tacttagcat gacacgtatt cactatcatt tcctggagta 
8340 

Page 21 



WO 02/36784 PCT/AU01/01407 



Untitled. ST25 . txt 



agagctgtgt gtgagtgggt gtgagcatgt gtgaaacctt ttctctgcaa tctcagttaa 
8400 

gaaaccaatc cagaatttaa agttcagggc ctaaatgggt ggttatcttc tcccagttcc 
8460 

atcctatccc acctttgctc ttcctcccgc ccacaggagc tgttggtcct tgattgggct 
8520 



ggaagacctg gtggacccta agtgatctat aagaggagaa tagagaacag ggaatgtctt 
8580 



caaaaatcta gagggacaca gaggctgaga ggcaggcagt cctgcagggt cttctgattg 
8640 



ggacaaggag aaccttggtc ttcacaggcc aattctggtc agtttccccc atggacagat 
8700 



gaggaaacag gcccaggaat atccaaggtc tcacacttcc catctgtcaa gtcttgttga 
8760 



ttctgttgta ttcatgtctc tcaaagggag atagagttta gggaagaaag aaggatcaac 
8820 



tgtgtctgat accactggga gcttaagtaa agggttcttt tacttcatag catttatccc 
8880 



aatttgtaat tcagtattat ttgtgtggct gtttggtgtc tctttctcct atatgagtgc 
8940 

tagcttcata agggcaagga ttttgattct ttaatattta gtgcttgcca catgccctga 
9000 



acacagcagg catacaggct aaccaacata cagtggcatg aaagtcatga aagtgagaca 
9060 



cctacctcct ccagtgccaa gagagcataa ccatgcacct gtcactctcc tcaacaccac 
9120 



ccccaagcat gaggcccaaa agcattagct aatcccctcc tccagccact aaaacttaaa 
9180 



ggccaggtgt ggtggctccc atctgaaatc ccagaacttc aggagacagc agcaggagga 
9240 



tcacttgagg ccaggagttt gagatcagcc tgggcaacat agctaggtcc catctgtact 
9300 



aaaaattagc tgggcgttgt tgcatgcctg tagtcccagc tactaaggag gctgaggtgg 
9360 
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gaggatcact tgagcccagg aggtggaaac aacagtaagc tataatcaca gcactqaact 
9420 

ctagcctggg caacagagtg acaccctgcc tcaaaacaat tttaaaaata aataaqaoca 
9480 

aaacttagat accacgtggt caccccaaca tgcaaaatca agttttcccc tactgaqaaq 
9540 

aatggggact tgacagctga gttacagaga gataatcttc ttcttctttt ttttttttta 
9600 

gtttacatcc tcaagatcat gacttgtgaa atttgaatcg aatacacatg taattccaga 
9660 

gcaatgttgc ctccgcatac catcagcaat tcacttggct actggaagtc aggat 



<210> 3 

<211> 632 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tctagagaga tggttcattc ctttcatttg attatcaaag aaactcatgt cccaattaaa 
60 

ggtcataaag cccagtttgt aaactgagat gatctcagct gaatgaactt gctgaccctc 
120 

tgctttcctc cagcctctcg gtgcccttga aatcatgtcg gttcaagcag cctcatgagg 
180 

cattacaaag tttaattatt tcagtgatta ttaaaccttg tcctgtgttg accccaggtg 

aatcacaagc tgaacttctg acaagaacaa gctatcatat tcttttcaat tacagaaaaa 
300 

agtaagttaa ttgataggat tttttttgtt taaaaaaaat gttactagtt tttgaaaagg 
360 

taatatgttg cacatggtaa acactaagaa ggtataagag cataatgctt ttatactact 
420 

aagaataatg ttttctctaa gttttttttg gtagatgctt tcatcagatt aagaaaattc 
480 

cctgctatta gttgttgaag gtttttatat cataaatgaa agttgaatat tattatcata 
540 
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tattattaat atattgttat 
600 

ttataaccat tctcctttaa 
632 



<210> 4 

<211> 11186 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ggatccagtt tcagctttct 
60 

agggaatcct ttccccattg 
120 

atgtgtggtg tttgttctga 
180 

tactagtacc atgctctttt 
240 

gtgattcctc cagctttgct 
300 

tggttccata tgaaatttaa 
360 

acttgatggg gatggcattg 
420 

tattgattct tcctatccat 
480 

ttttgttgag cagtggtttg 
540 

gattactaga tattttattc 
600 

gctctctttc tgttattggt 
660 

cctgagactt tgctgaagtt 
720 

ttttctaaat atacaatcat 
780 
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tgaactatca aagccttttc 

cctgttgacg ag 

acatatggct agccagtttt 
cttgtttttg tcaggtttgt 
ggcctctgtt ctgttccatt 
ggttactgta gccttgtagt 
ctttttgctt aggattgtct 
agtagttttt tttccaattc 
aatctataaa ttaccttggg 
gagcatggaa cattcttcca 
tagttctcct tgaagaagtc 
tcttagtaac aattgcaaat 
gtataggaat gcttgtgatt 
gcttatcagc ttaaaaggat 
ggcatctgca aacaggaaca 
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ctaaaaccat tgagatgatc 



cccagcacca tttattaaat 
caaagatcag atggttgtag 
ggtccatatc cctgttttgg 
atagtttgaa gtcaggtagc 
tgggaatgtg ggctcttttt 
tatgaagaaa gtcattggta 
aagtatggcc attttcacga 
tttgtttgtg tcctctttga 
cttcacctcc ctttaatttg 
gggagttcac tcatgatttg 
tttgcgcatt aattttgtat 
tttgggctga gacgatgggg 
atttgacttc ctcttttcct 
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Untitled.ST25.txt 
aattgaatac cctttatttc tttttcttgc ctgattgccc- tggccagaac ttccaatact 
840 

atgttgaata agagtcatga gtgagggcat cgttgtcttg tgctggtttc aaagtttttg 
900 



cccattcagt atgattttgg ctgtggtttt gccataaata gctcttatta ttttgagata 
960 



cgttccacca atacctactt tattgagagt ttttagcagg aagggctgtt gaattttgtc 
1020 



gaaggccttt tctacatcta ttgagacaat tatgtggttt tttaatcgtt gattctgttt 
1080 



atgtgatgga ttacatttat taatttgcat atgttgaacc agccttgcat cccagggatg 
1140 



aagcccactt gattgtagtg gataagcttt ttgatgtgct gctggattca gtttgccagt 
1200 



attttattga ggattttggc atcaatgttc atcagggata ttggtctaaa attctctttt 
1260 



tttgttgtgt ctctgccagg ctttggtatc aggatgatgc aggcctcaga aactgagtta 
1320 



gggaggattc cctcattttc tattgattgg aatagtttca gaaagaatgg taccagctac 
1380 



tctttgtacc tctggtagaa ttcagctgtg aatccatctg gtcctggact ttttggttgg 
1440 



taggctatta attattgcct caattttagg gcctgttatt ggtctattca gacattcaac 
1500 



ttcttcccgg tttggtcttg ggagggttta tgtgtccagg aatttatcca tttcttctag 
1560 



attttctagt ttatttgtgt agaggtgttt atagtattgt ctgatggtag tttgtatttc 
1620 



tgtgagatcg gtggtgatat cccctttatc attttttatt gcatctattt aattcttctc 
1680 



tcttttcttc tttattattc tggctggcgg tctgtcaatt tttttgatct tttcaaaaaa 
1740 



ccagctcctg ggtttcactg attatttgaa gggttttttg tgtctctatt tctttcagtt 
1800 



ctcctgtgat cttagttatt tcttgccttc tgctagcttt tgaatgtgtt tgctcttcct 
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I860 



tctctagttc tttgaattgt 
1920 

tgtggtcatt tagtgctata 
1980 

tctggtatgt tgtgtctttg 
2040 

tttgttattt gcccagtagt 
2100 



gttttgagtg agtttcttaa 
2160 

gtttgttgtg atttccattc 
2220 

tcaattttag aataagtgtg 
2280 

tgtggagttc tgtagatgtc 
2340 

ttttatcaaa gaagaaacac 
2400 

aaagtggctg gcctctggtt 
2460 



ggatcacttg aggccacaag 
2520 



catgtctaca aaacatcaaa 
2580 



tactcaggag gctgaggcag 
2640 



catgattgtg ccattgcact 
2700 



taataataat agtaatagta 
2760 

ttacttattc aatacacaaa 
2820 

aaaaaagctg aaccattctc 
2880 



Untitled.ST25.txt 

gatgttacag tgttgatttt 

aatttccctc tacacattgg 

ttctcattca tttcaagaac 

cattcaggag caggttgttc 

tcctgagttc taatttgatt 

ttttacattt actgagcatg 

atgtgatgct gagaagaatg 

tattcagtcc actgggtgca 

agctatcaaa aatccagaaa 

cattcctgta atctcagtcc 

ttcaagacca tcctagacaa 

aaattagcca ggcatggtga 

gaggattgct tgagcctggg 

ccagcctggg caatgcatca 

ataataataa taataaagaa 

gttaaaagca atttctactt 

acagcctgaa atgcttctca 
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agatctttcc tgctttctct 
tttacatgtg tctcagagat 
atctttactt ctgccttcat 
agtcttcatg tagttgtgtg 
gcactgttgt ctgagagaca 
ctttatgtcc cattatgtgg 
tatattctgt tgatttgggg 
gagctgagtg gacatgaaca 
tattgaacct tgttaataat 
tttgaaaggc tgagaaagga 
gtcagttcaa gaccagactt 
tgcatgcctg tcatcccagc 
agattgaagt ggcagtgagc 
agactctgtc taaacaataa 
aacggttggg acgccattcc 
tctctatttt tttattacta 
ccttcccctc ttctatacaa 
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acacttctct gttgatgata 
2940 



agttccagat ccccttatct 
3000 

gtgtggtgaa gtacccccac 
3060 

acattttccc ttatggttct 
3120 

acccattcac accgatagag 
3180 

aatagggcat tgagggctgg 
3240 



gaaagtcaca ggatttggca 
3300 

ttcatgcccg atgctggatg 
3360 



gaaactcatg tcccaactaa 
3420 



tgaatgaact tgctgaccct 
3480 



agttcaagca gccccatgag 
3540 

gccctatgct gaccccaggt 
3600 

ttcttttcaa ttacagaaaa 
3660 

tttggaaagg taatttgtgc 
3720 



actgctgaga ataatgtttt 
3780 

aattcactgc tgttagttgt 
3840 

atgtattatt aatatattat 
3900 



Untitled.ST25.txt 
atgcagacag tctctccttt 

ctgccttccc agagctcctg 

cttgggtctc agcatgactc 

gttcccctgt gttgtgtcac 

agggccccat ggttctggag 

gtgtcaggca tgggctgcac 

agcttcctag taacatctct 

ctggtttatt ctcgagagat 

agttcataaa ctccagtttg 

ctgctttccc ccagcctctc 

gcattacaat gtttagttat 

gaatcacaag ctggacttct 

agtaagttaa ctgataggat 

acatggtaaa caagaaggta 

ctctaatttt ttttggtaga 

tgaaggtttt ttatatcatg 

tattgaacta gcaaaggctc 
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aggaatactt cacaccaggt 
gtgtctcccc agttccctct 
gttctttgaa ggtcttgttc 
agcactgggc agagtggaca 
ataaccatgt aactgatcag 
ttgggtgggc aggccccctg 
ccctggggtc ctcttggaac 
ggttcattcc aataatcaat 
taaactgaga taatctcagc 
agtgcccttg aaatcatgtc 
ttcagtgttt attaaacctt 
gacaaggaca agctatgata 
tttttaaaga tgttactagt 
taagaggata atgcttttat 
tgctttcatc agattaataa 
aatgggagtt gaatattatc 
ttcctaaaac aattgagatg 
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atcttataat cgttctcctt 
3960 

tgttaactat tcttgagtct 
4020 

acactcctgt ctctggcttg 
4080 



tgagactggc atgtcatctt 
4140 



ccctcattga atgagttggg 
4200 



ctttggagtc tctccagatg 
4260 



gcaacccttt ctcatttctg 
4320 



tttgtttgtt tgttttttga 
4380 



caatggcctg atctcggccc 
4440 



ctcagccttc tgtgtagctg 
4500 



atttttagta gagacggggt 
4560 



ctcaaattcc tgacctcatg 
4620 



tttgtgcctc cacacacagt 
4680 



tgtgagcatc ccgggatgac 
4740 



cagcgggacc ctctccagca 
4800 



gatgccctgg ttgactcctg 
4860 



cccagtctat gctaaagaac 
4920 



gcagagcctg gggagtagcc 



Untitled.ST25.txt 
taatctgttg atgagatcat 

caggtttaaa ttcaacttgg 

ctactattgt gttcagcatt 

cctttgcggt cctgattttt 

tgtgatgcct tctttttcat 

gccctcaatg gtccctgcct 

gtaggcccag gaacctgtgg 

gttggagtcc tgctttgtct 

actgcaacct ccacctcctg 

ggattacagg catccaccac 

tttacaatat aggccattgt 

atctgcctgc ctcagcctcc 

gaatctgtgg tttttaaaag 

agccactgtg tgtccagctg 

tttgcttgct gtggtcatga 

aaggatggtt aggaatgcac 

agcttgtgtg gacacaagga 

aatcgcacca tttgcttaaa 
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tggtatttat actttttctc 
tcatggtgta tcatctttga 
tttgcactga tgccgatgaa 
ttcagatttg gatcatgtgg 
gtatctggat tgatgggaca 
cctcattgtt aggctcctag 
gttttatgtt tgtttgtttg 
cccaggctgg ggttggagtg 
ggttcaagtg attctcctgc 
cactcctggc taatttttgt 
gatctcttgg acaggctagt 
caaagtgctg agattacagt 
ctcctcatgc atgtgaattc 
ttaaaactgt gagaaagcac 
aagaggcttg ttggggagat 
cagatggaag ctgggttgga 
gacacgaaca catcattttt 
acaccgtgta cagttggaga 
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4980 



agtggactga gacaggctga 
5040 



cttcctagat atacttagat 
5100 



ccaaggaagc atgctttaag 
5160 



gaattccctg gataagtact 
5220 



aggagtatga gaaatggtgt 
5280 



tcatttcctt caccaatttc 
5340 

gcctcatatt ctttgatttc 
5400 



tcctggaaac acaaatgatt 
5460 

tctgctgaaa catgagctca 
5520 



ttttcatggt cataaccagc 
5580 



gtgtagagag aaatatttat 
5640 



gccttcttgg aatcccagca 
5700 



ccatgccctg gcttcatggc 
5760 



catctataaa gtatgagtta 
5820 



gatattagga gagttgggaa 
5880 



gttactttat caagaaatag 
5940 

gtcctcctcc tcttcttttt 
6000 



UntitlecLST25.txt 

agaagctaac agtggccaga 

tttatcctgt gagtgatagg 

attccatgtt ttttgagatg 

gcccagggta ggcaaaagaa 

aggttttgct gatgttttgt 

ctgtttcatt aattcccttc 

tcttttacat gttggtttca 

ctgattgtga catgtcagaa 

ctgaatacac aatttagtaa 

tctgtagcat tttataacta 

ctcatgtgtg gctgacacaa 

agaacaccac tgatgcaatt 

ttagtcacgt ctgaagtcta 

aatcatccta atactactca 

agaactgtat aaattatgaa 

acactccaga atagagtaga 

ttttaattta caaagaaagg 
Page 29 



PCT/AU01/01407 

tgagaaaggg tcttgtgtta 
aacagttgca gggactgaag 
ctgtctggtg gctgagtagg 
gctaggaggt tactgaaata 
aacacatctc atgacaatct 
cacgtgctct tctgaaattt 
tcacctttta ctttttgctt 
ttatttgcaa cattcccctt 
agtgtaggat gcacatgttg 
cactggcagt gtgctgggag 
cctgccaagt tgttttagga 
tgaaatcaca atgtcctgct 
tttctaacta tctgtttcca 
tcttacaaag ttttcttgct 
gtgccatgga gatgttggtg 
aagaaaacag ttatgattaa 
tttaattgag tcacagttcc 
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Dntitled.ST25.txt 



atatggttgg ggaggctcag aaaacttgca atcatggcag ttggcaaagt ggaagaaggc 
6060 



accttcttca caaggtggca ggagagagag agctcctctt cttttttgtt gtaaagtcta 
6120 



cagaagtgca tatacttcag ggcaagggca ggcagggaga agaaaggaca ttgcttcacc 
6180 



ccagtcctca ctgacaagtt tgctttggga cttcattttg tcccagcata tgggacagag 
6240 



ctctggccac tacccattca gaaggcctga gctgcattgc tagttcccca ctaactctgt 
6300 



gtgtccttgg gcaaggctgg gcttatgtca aaagattatg accctgggct ctccagctac 
6360 



agaatctaca tatgaatcct ggctctgcta gagcaattag tgacgtaacc ttggatgggt 
6420 

cagttaacct tcctgtggct tagtttgctc atctgtaaaa tagggatcat aacaacatca 
6480 



ataccatggg ttgttagaca gattgaatca gttaatgcag ggtaaatact tagcatgaca 
6540 



cgtattcact atcatttcct tgagtaaaag ctgagtgtga gtgggtgtga gaatgtgtga 
6600 



aaccctttca ctgcaatctc agttaagaaa cccatccata atttaaagtt cagggcctaa 
6660 



atgggtggtt atcttctccc agttgcatcc tatcccacct ttgctcttct cctgcccgta 
6720 



ggagctgttg gtctttgatt gggctggaag acctggtgga ccctaagtga tctataagag 
6780 



aatgagaata gaggacaggg aatgtcttca aaactcctag agggacacag aggctgagag 
6840 



gcaggcagtc ctgcaggggt cttctgattg ggacaaggag gaccttggtc ttcataggcc 
6900 



aattctggtc aatttccccc atggacagat gaggaaacag atccaggaat atccaaggtc 
6960 



tcacacttcc catctgtcaa gtcttgttga ttctgttgta ttcatgtctt tcaaagagag 
7020 
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agagagttta aggaaagaaa 
7080 

aagggttctt ttacttcata 
7140 

tgtttagtat ctctttgtcc 
7200 

tttaatattt agtgcttgcc 
7260 



acagtggcat cgaagtgaga 
7320 

ctgtcactct cctcaacacc 
7380 

ctccagccac caaaacttaa 
7440 

taggagacag cagcaggagg 
7500 

tagctagaca ccatctgtac 
7560 



ctactaagga ggctgaggta 
7620 

ctataaccac agcactgaac 
7680 

tttcaaaaat aaataaataa 
7740 

aatcaagttt tcccctactg 
7800 

tctgcctttt tttttttttt 
7860 

atctaataca caaatcattc 
7920 

ggccactgga agtcagaaca 
7980 



aaaggatgaa aatatcggag 
8040 

tceaetcatg ctgatatctg 



Untitled.ST25.txt 
gaaggatcaa ctgtgtctga 

gcatttttcc caatttgtaa 

tattagagag atagcttcat 

acatgccctg aacacagcag 

cacctacctc ctccagtgcc 

acccctaagc atgaggcccg 

aggccaggtg tggtggctcc 

atcacttgag gccaggaatt 

taaaaattag ctgggcatgg 

ggaggatcac ttgaacccag 

tccagcctga gcaacagagt 

aaacaaaact tagataccac 

agaagaatgg ggacttgaga 

tttggtttac atcctcaaga 

cagagcaatg ttgcttctgc 

agcttcccag aagagaggta 

tgaaggtgtt ccttgcatca 

agcctaatgc ttctctgaat 
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tatcactggg agcttaagta 
ttcagtatta tttttgtcac 
caggacaagg attttgattc 
gcatacagac taaccaacat 
tagagtacat gtccatggac 
aaagcattgc taatcccctc 
tatctgaaat ctcagaactt 
tgagacgagc ctgggcaaca 
tggtatacct gtagtaccag 
gaggtggaag ctacagtgag 
gagaccctgc ctcaaaacaa 
gtggtcaccc caacatgcaa 
gctgagttac agagagataa 
tcatgacctg tgaaatttga 
ctaccacgag taattcactt 
ccacttggac taccaatata 
ctgagtccct ggacagcctg 
gttgagattt aactttgatc 
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8100 

caatgaaacc agaccaagaa 
8160 

ctcacaattt tatgattatt 
8220 

tctccatgtg cttagggtac 
8280 

actgacagca cagctttaaa 
8340 



aggcaagtgg tgcttgggtg 
8400 

agctctcagt aggcaagtcc 
8460 

cagaacatgc cttcaattat 
8520 



gagtgctggc tgcatttgaa 
8580 

caaccagtca caccacctcc 
8640 

aatcatctta gccaagtaag 
8700 

agttttgttt ctaaaaaaat 
8760 

atagttatca gatgacactg 
8820 



tgtggcaaca tagagaggga 
8880 

ctttctttct ccctgctgag 
8940 

ggagtaaccc ttgagacagg 
9000 

ttactatttg ccatagtgat 
9060 

cctaatatct cccttcctgt 
9120 



Untitled.ST25.txt 

agaagaaacg tctttcattg 

tttccttagc tgtcctataa 

aaagttgacc aagaccaaga 

aacatgatga atgctttaat 

ttcttccaat gcatagtatc 

ctacatgtta gaagatgtta 

tttcctttgc agaacaacac 

ttccaagcaa cgattagtct 

tgaagtttgt tgggcaggta 

tgttttagag gaaagaagaa 

aagaaaggct ctttgtttag 

gaatctttac ttgccagaat 

gatcctccag caatgccatt 

atggggtcct ttgttctaga 

aacacgaatc tcaaccaatt 

gcttagcaag gaatggcagg 

taacttttta taatatttta 
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ttgataagga catgattttt 
ttatctgctt atttgtctct 
ataatgtctg ggagcacaat 
acaggaaatg agtaggggag 
ttccttgaca cagtcagtgc 
ctttctgtgg aattaggtgg 
caatttcatt agttaggaca 
atcactgttg gtatagattc 
aatcttcatc ttagaataaa 
aacataatcg tttccataag 
gtgagctaat gaagttgttg 
gtgttctgtg cacctctcgg 
gatatggtca gaaactgcat 
aaacccaggg ggtgccactg 
tctggttgca gccttgagtc 
tgcaccagag cagcagagga 
ttgtgatcag tatcagttgg 
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Untitled. ST2 5, txt 



gaagctactt gcagtcactg agcctcagtt tctacatctg taaactgggg atagtagcat 
9180 



ggccctattt aatgtgctca gcgaagccac tgaaaggaga cagaaatgta ccagaattcc 
9240 



ctggactttt atcctacttc tcctggggat tgtcacccac ctacccgtgt ctgtcctttg 
9300 



ttgctttgac gctgtcactt cttttcttag gtacctctct gtagggctcc attattccag 
9360 



ggattccaga gttacagcac atgcatacct ccatccaagc atgtttattt gtctcctgct 
9420 



tcactaggct gtccccaagg aacatgtggc tcccggcaca tacctggcac aacactgcac 
9480 



atgacattca cccacttggc cttgaatctg acaaggaatc tggcatgatg ttcacctgct 
9540 



gaggccaggt gccgagcagc cctggaggct taggggccag agggatggga aaaggtgtct 
9600 



ttctggggtg agtatcagtt tctgcaggag tgctgaacct gagaaagaat aaagagagaa 
9660 



ggaagtgaac aagcacagct taaacatcat ctgtttctac tgagttttaa caactctgag 
9720 



attttgtttg tcatggaatc catttctcag gccaagcaga cacagaactt gggtgtgagt 
9780 



gatgataatg agctgatata attttcacac cctcatcact gagatctctc ccatcaggaa 
9840 



tgggtcacag ggctcacagg tggcagcaac tgttattaca ggcctcatct ctaccagctc 
9900 



ctggcacctg ctctcctctc attagaaaat cctccacttg tcaaaaagga agccatttgc 
9960 



tttgaattcc aattccaccc tcaagaggct gggaccacct cattggagtc cttgatgctg 
10020 



tgtgacctgc agtgaccact gccccattgt tgctggctga ggtggtttgg gtcaacctgg 
10080 



ccatctgggc agctgttctc ttctcttctt tctcccctac tgtttccaga catgcagtat 
10140 
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ttccagagag aaggggccac 
10200 



gacttttgaa agctacagag 
10260 



tccatctcat ccatgccatg 
10320 



gacaaatctg aaagtgcaga 
10380 



atgacctaag aagatggagt 
10440 



gcaatctcag ccaagtcaac 
10500 



tgatagggac cagagccatg 
10560 



tctgctaagg gaaacaggcc 
10620 



gaactgcatg aaggggatgg 
10680 



ggttggtgag gaaagactct 
10740 * 



ggagtcttct agggggcaca 
10800 



tgttgggatg agtttcaagt 
10860 



aatagatttt atgccaatgg 
10920 



tccccagtaa cattgattga 
10980 



tcagtgagtg gtgtgtgtgt 
11040 



ctgcaggcag agcacggggg 
11100 

agcatataaa caatccaaca 
11160 



cacacacagc ccagcaaaca 



Untitled . ST25 . txt 
tctttggcaa agaacctgtc 

gaagaagcac aaattgatgc 

tctctttttt aggggtcctc 

gacagcagct gaggcacagc 

ggtcaccaga aagtcagagg 

tccaccagcc tttctggtcc 

agagtgagta agaccagact 

tggaaacaca caatggtggt 

aaagtgccca ggggaggaaa 

aagagaaggc tctgtctggc 

ggcacactcc aggcataggt 

attctggaat gaggacagcc 

ctccacttga gtttgtgata 

gttgtgtatg attctacata 

gattatttgc caactgccga 

ccctgctact ggctgcagct 

gcctcactga atcactgctg 

gcagca 
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taactttcta tctacggcag 
tattccacta agccatcagc 
ttgccaacag aatcacagag 
caagagctct ggctgtatta 
aagtgacaca caggggccca 
ccactgtgtg tacagcaccc 
atgcccttga ggagctcacc 
aaagaggaaa gaagacaata 
tggttacttc tgtgtgaggg 
tgggtatgaa aggatgtgta 
aaagatctgt aggcatggct 
atagagacaa gaggagagtt 
agaacccaga acccttggac 
gaatattaac tcaatggagg 
ggtggagaag cctcttccga 
ccagccctgc ctccttctcc 
tgcagggcag gaaagctcca 
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11186 



<210> 5 

<211> 10200 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gccaattaga agaaacacaa 
60 

acagtcaaat atttggcaga 
120 



caacaaaaag atatgaatca 
180 

aagttctgct agtgtttcta 
240 



tatagtttta ggtattttac 
300 



aatagcaatt ccttcacatt 
360 

tcagttggcc agctttggta 
420 

aaaatgtact ctaggaccta 
480 

agctggtcta gctggtcaag 
540 



gtgaagtaaa atagctagag 
600 



tagccaatct aaaatcctca 
660 

agctcagtgt cagaaaacct 
720 

tttggtcaag ttctttggca 
780 

cctctacatt ctcttgcaag 
840 

aaattaaggt tattttaaag 



Untitled. ST2_5.txt 

ctacaaggtc agggcatatt 

attacaaaat atctcattgg 

gaattcatct gtgtctcaag 

cactaccgtt agcctcatta 

atacattttt attattaatt 

ttttttttca aatcttgcac 

tctgatactt ggactacaga 

cagcccttct gctagatcct 

agcaaaaata aaatcaagat 

aatatgattg cgcctgtccc 

ctgttagaat cctcctgtca 

gtgttgttaa cttggccctc 

tttcagagac tcagagtagt 

ctcaaacatc tatgaatcca 

aaataacctt aaaattatta 
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attcaaacag tagagacaat 
aaaagacacg caagggaaat 
aaaaggtcat gcgataaatt 
ccttattttt taagtgttaa 
acaaccaaag tgcaacttgt 
cttaaaatcc acctcgggcc 
taccactaag gcaagtagat 
gaagaatgat cattaaaaca 
gacagaaaat tgatgcaaaa 
cttagcatgg attcccatgc 
atatgataga atgaacagca 
tttctagctg aatgtgtgtt 
gaaggaagtg gataagatga 
gagagaaaaa ctagagcatg 
gtattcgagg atctccaata 



WO 02/36784 



900 

tattcatggc accactcaaa 
960 

tccttaatga ggtctgccct 
1020 

cttccactct ctttcccctg 
1080 

aaaatcactt atttactgtg 
1140 

ctacctaggc agggattttt 
1200 



atatgtgaca tatacagggt 
1260 



acaaatttaa aataaaatat 
1320 



tcattggtct gctagcagtg 
1380 

gtgaaataat actaagtgtt 
1440 

cttgaccttg ctcctgggag 
1500 



cataaaacga taaaaaaaac 
1560 

agccatttcg ggtaatattg 
1620 

ggcaaagggt ttggttataa 
1680 

caaccaaata tgtgttgaat 
1740 

tgctaagcac aggagaaaat 
1800 

aacttctttc aagcaaatca 
1860 

acattcggaa tatagacatc 
1920 
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actttccttc tgctctatcc 

gactatccta cttaaaattg 

ctctgttttt caccgtaata 

ttgttatcta tctgcctact 

gtatgttttg ctcatggata 

acttgattaa tactgttgag 

ttcctaactt aaaattgtaa 

tttcttgtat atggaaatat 

ctaaagaaat aagtgagtga 

agagttcatt tgagattaaa 

taaaagggaa atggtgtttt 

ttatttttat gtcaggaatt 

attaagagaa tgaggaaata 

gtcaaaggaa tttccctgag 

ttggcttatt actttatagc 

cttactagtc tattaacaat 

caggtactaa gctgattgct 
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cgtcttggct caaagttatc 
taaactttgc ccacctggta 
ctttactctt tttaacatac 
cttaccatca aatataagtt 
tatacgaagc acttagagta 
•tgaatgaatg agtttccaat 
agtcagatct aaccaactgt 
attttaaata gatatgtcct 
acgttacctc attgaactaa 
caagttcaaa gtctatgaat 
tataagctct gcaattcaaa 
cctcagtgct gatatcttag 
ggtacatagt aggattgttc 
gaataatctt cagaataatt 
cagatttcat ttttaattga 
aacaacataa acacaagtaa 
ttacactcac tgtcttattt 
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tacaagtaag gagttttagt 
1980 



aacttaaacc caatctgttt 
2040 



cttgtttcta aaacttaaat 
2100 



aaaatcatat ttttgtagac 
2160 

ggcaggcatc attgaaattg 
2220 

ggtcacttta gatatataag 
2280 



gccttatatg ctaaaattat 
2340 



ttgaggaact ttacacggct 
2400 



tagtgatcaa taaaagtttt 
2460 



aaaaaattag ccaggcaagg 
2520 



tggggagatc ctttgaatgc 
2580 



gcttcaacct tggtgacaga 
2640 



aaaaaaaaga atcaaagaaa 
2700 

ataagctata aagtgtcaaa 
2760 



ggaattgagc acatagtatg 
2820 

tctaccctca cagaacttaa 
2880 

ctctcttctg tattatcttc 
2940 
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tgcagcaaaa gaaataaatt 

ggtgctaaag ccaatgttct 

ttatcagtaa aaggcaaaat 

tctgaggaca aatccaacaa 

gtatatagct tccttgggta 

ttcagtctgt ttgtaaaaac 

cctaatcgtt ttcaccttta 

catcttcata ttgtcagcat 

agccagcctg ggcaacatag 

tggcgcacac ctttggtccc 

aggaggttga ggctgcagtg 

gcaagaccct gtctcacaca 

aaaataattc cccagcttaa 

taatgctgtt aatggacatt 

tgctgtattt tatatacaga 

attcttcagg agaatgacac 

ctcaagtgga ggtatatggt 
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ttccaatgtc aaatgaccag 
ttactgcaat gttgggttat 
ttgctattat tgaggacatt 
aaaagttcca actatttctt 
ttgactttga aaaggaagtt 
aaaatgaaaa caaaacagtt 
acaacatata cacacagaac 
ctagcaaagt acttgccaca 
tgacagccta tctctacaaa 
agctacttga gaggaggatg 
agctgtgatt gcgccactgt 
cacacaaatt aagtaaagaa 
gtccatcttt atttgtttgg 
tctctagctc tcccaaagga 
ataaaaatag agacaagatt 
tgaagtcctt aattggactt 
gcttagttat gaaaaatacc 
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tccagggctt tgatcttctc 
3000 

aaaaccatgt atccaactta 
3060 



agcctcccat gtattttctt 
3120 



tcttcctctc tcacttttct 
3180 



ttaataaatg aaaaatactg 
3240 



ccttcattca atgtgagtgt 
3300 



ataaccagga ttctgaaaat 
3360 



ataaaataat acacttttaa 
3420 



tcagacctaa ctttcatcac 
3480 

taatattaca ccaattaatt 
3540 

tgccttttgt ccttgcatat 
3600 

cacgttttat attactactt 
3660 

tgtggttaag tattcaggtc 
3720 



atacctttgg ctgatatttt 
3780 



aaattgaatt actgagtcaa 
3840 

tgaataattt tcaactccta 
3900 

tttcaaaata tctttgctgt 
3960 



tatgttttct tctgtgtacc 
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aataactctt tgaggctgat 

tatagacagt tgatgaccaa 

attacttaag gagaatctct 

tagaaacatg ggtaagattt 

ggaattctca atgtttcttg 

cccttaataa ggagcaaact 

gcattctcat ccccatctcc 

tataggaaat ttctcaaata 

taaaagataa tcactcttga 

tgcttgatat agtgaatatt 

tagcataggt attttctgag 

ttttaaagag gatgtataat 

actgctcatt tttcactgtt 

tttctgtact tggaattatt 

acagacttaa gttttcctta 

cttgtttatt taactcttgt 

tgtaatttgc atttctttgt 

agtatactac atactttttt 
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atgaaaacag taattagaaa 
agctagaatc cagttatttc 
atctctacct ctttctctct 
tcagaaatat gagaaactta 
ttttagccag ttaattttgg 
ccactgagag atagatacta 
aaacttttat aaaaaatatt 
cagaaaaaat taaagtaaac 
cactttgata tctttatctc 
atgctattat aattttcccc 
gttatcacaa actctgtaag 
aattaattca tccatatata 
ataaaataaa gcagcaatga 
tctttaagat agatttccta 
tgtatgtttc cttattcatt 
gagcatgtga tagtctcatt 
agttagcatg aatatttcag 
atatgaattg cctatttgca 
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4020 



tcttttgctc ggttctatta 
4080 

atatattaag aatattaacc 
4140 

tggttttaaa tgtgaattta 
4200 

ttaaaagtaa taaaattatt 
4260 



ttccagtcat cacttttact 
4320 



ccatctttat taggatattt 
4380 

accttatttt cttcttacaa 
4440 

cacagaaatc aaggtttttg 
4500 



ttttgagcag tggagtattt 
4560 



ccaagagttg ccacttcatt 
4620 



cttaattttg tcattcattt 
4680 



agggggcagt gacttactca 
4740 



ttgatagagg aggctgggcg 
4800 



aggtgggtgg atcacttgag 
4860 



ccatttctac taaaattaca 
4920 



ctactcagga ggctgaggca 
4980 



cctggattat taacactaca 
5040 
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gacctttgaa ttttttctta 

tattgtgata tttgcaataa 

ttcaattttc ttcataattt 

gccatattat tatttatttt 

ctcatgttct taatttttat 

gggaaataat acaatttata 

aaaccctagg aatatgctgt 

agcagtggag tatttcttca 

cttcaaagcc acacagctag 

ttcttctttc cctttatctt 

aaaaacattt cttgtgctat 

gagagatgat tctctaatgg 

tggtggctca tgcctataat 

gccagaagtt caagatcagc 

aaaattagcc tggtgtgctg 

tgagaattgc tggaacccag 

ctccagcctg ggagacagag 
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tccatttata taagctcttt 
atagctatat ggtttgttgt 
tgttgttttt atagatttct 
caatgtccat tactagccct 
tcatatcttg gctccattga 
ctaaacacac atcaaatctt 
ttttgtcttt atttgaatga 
aatgacacag aaatcaagtc 
taagtcatga agctggaatt 
actcagttgt cttctctcct 
tatggtagat ttattttaat 
agttttaaag atcttagaag 
cccagcactt tgggaggctg 
ctagtcaatg tggtgaaacc 
gtgcactccc ataatcccag 
gaggcggagg ttgcagtgcg 
taatactcca tctcaataaa 
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aagaagttga taagggagat agttcatggc aacggatctt tgaaggcacg ctaatgataa 
5100 

cttaggcatt tagcctacta gtgtaatttc cataaatctg cctctgatgt catactctca 
5160 

gcacctaata ttttctacaa acatttattg aaactttatt ttgtataagt ctctqtccaa 
5220 

tttgaatatt taaaaaattc ataatcatat gaaacattaa taataaatac aaaatqaqaq 
5280 

atgccgatac tgaaaagtag gattgcggag tggtagaaaa tatttctggc tgtagtagat 
5340 

ggggaagtgt tcaaagagga gtataattca ggtttccatt tgccatcgac ttatcacatq 
5400 

gctaactcac taagcgactt aattaaaatt aaattaattt atcatcatct gatcaccatt 
5460 

tcacacaact catgtctgtt gctgtattgg ctaaatgatg gcaagacaaa cgacctctga 
5520 

aaatgatcct attgaccttc ggaatctgga tttttttttc aatgcaggtg tccatagaag 
5580 

caatctgatg taatccaaca tgagttcaag cacagtcatt taatatcccc tatcaaqtac 
5640 

agtcatttaa tatcccctat catcacatgt ccttcataca taaaaatcat tacatgtqaa 
5700 

agggtggaga gtgtgtggat cccttattat tgtgttattg taacacaata acaatattqt 
5760 

gttattattg ttaacacaag tgagtcatat gtcttgctct ttggactgag tggaaacttg 
5820 

tattctttct ctgcctcagg tcaattaact tcattgaggt gagttgcatt ccttctttaa 
5880 

gcatgttgaa ccttcaatct ggactcagat gggctaaata gaggagctag gaaaaataca 
5940 

gaaaataaat tattagagag atcagagaaa gatacataag atttacaata aaaqaattat 
6000 

gagaaaaaca tccaaaagaa ttaaaaacca taggagaagg aaaaataggt gaacagcttt 
6060 
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ttaatttcta taaatgtgtt gttaatactc ataataaagg actcagagct gggatatgag 
6120 

aataataggt caaacgtata tggatacata gatgtgacta catacatgag ttgcaaagaa 
6180 

tgctaaggag ggcaaaaaga gattgagaag agggcattat tactaatata tagcaatgtt 
6240 

gaatgtttag ggtgtttcag gcactgtaca aatcttttaa atacacaaat cacttaatct 
• 6300 

tgccataaca ttagaagata ctatctactc tttaccaaaa aggtaactgt gggatagcag 
6360 

agttaagtaa cccgttcaaa cctatgcata ataatcagca gagatggtcc ttatctaagt 
6420 

ttttctgcct ttgaagtcca aatagtttaa tgcagccagg tactaaagaa gaaaactttt 
6480 

gtaaattagt ttagtttaat gatttacatg tggaaaagca cagagtgaaa agcacatatc 
6540 

atctgagaag cccagtgagt ttggctgaaa tggagtgaac atgtacatgt tgagggtgag 
6600 

gaagatgatt agagagaagt gatgtgttgt ggttcttaaa agctaagagg aaactgttag 
6660 

atatgatata gtctgtggca gggagccatt gcaggttttc caagatggac tcataaggag 
6720 

aaaacacttc gtgggactgg aaaaggcagt gaagtggtgt gtcctatgat aatgtactac 
6780 

agtgtaggat agtggttaag agtacagtta tgagagaggg actactggtt agcaccttac 
6840 

ctgctgtgtg actgggcaaa taatgcaaac ctcagtgtct tttattgtaa tatgggagta 
6900 

acaaaaatag taactacttc ataggattct tgtaaagatt aaatgactta atttctttga 
6960 

agtgcttggc agttcctgat aaatgaccag tagttaataa atgttagttg ttattattat 
7020 

cattatatat tattactccc atagatacat atagaacaga ctgcagcaga gaggcaaatc 
7080 

tttaatgttg tcagagtata gacaagttgg tgaaatggct acatgagagc ggaggacaag . . 
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7140 

aa 7200° a9a ttgtg9cagt cacttcaaat ggaaatatca ccgcttgaat gaaggtatat 

gagtgtcaac ttgcaagggg accaggtagg tttcatcaga aattaaggaa gcttaaqgaq 
7260 

aacagccaag ttcagcttga cagaagtggt ggtggcacaa atgcaagact ggtgtctttc 
7320 

aagaaaccaa ggactgttga aagtagcaag agctagtttg ttttaggtcc atcatatttt 
7380 

atattcacac tttcatgtca gtggagcaaa gaaatggaat acaatataat agaatqgtaa 
7440 ^ 

aatcttattt ttaaaatctg tgttattctg atctttaact tacttatatc tttgatagaq 
7500 y y 

at ^ttacct gatgctcaag attgtagaaa tagtataatc aacataacag tatagcactg 

tatttatatc ctgcactgtt tagggagggt ttaaggccat tcaaaaggat acataaaata 
7620 

caacaagatt acataaatga aaggtgagat aaagcaacaa agcaaaacaa aaataaaaac 
7680 

agagatcata ggcacaaata agattaaaaa cgcatgtaat gaagatgaaa gcttttacat 
7740 

ttaccccaga tggaccacag ggttgttgtt aagcctttaa acagtgaaca atgctgtaca 
7800 

cttgcatatg caattagaac atgtggaaaa aatagtggcc tgttagaagc ctaattaaca 
7860 

atttgtgaaa aaaaaaaaaa aaaaaaaaaa aagaggccga gctgtagctc acgcctataa 
7920 * * 

tccctgcact ttgggaggcc gaggcgggcg gatcacgagg tcaggagatc aagaccatcc 
7 98 0 

t 9|ctaacac agtgaaaccc agtctctacg aaaaatacaa aaaattagcc gggcgtggtg 

9C 8100 9CCt gtagtcccag ctacct ggga ggctgaggca ggagaatggt gtgaacccgg 

gaggcggagc ttgcagtgag ccgagatcct gccactgcac tccagcctgg gcgacaaagc 
8160 
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aagactccgt ctcaaaaaag aaaaaagaaa gaaaaacaaa agaaaacttc attgtattqt 
8220 

aaggccaaga acaaaatata tcaagataag gaaaatttgt agtcaagaat agaaaaaaat 
8280 

tatggctttg aagtatgagt tatttaaaga aagtggaaac atcctcagac tatgcagtaa 
8340 

aaaacaaagt gattttcttc ttctaaactt atgcaataaa ctgataggta atatqtqaaa 
8400 . 

gtcatagaat gtagactaga ggatacaaca aacctatttc ctctatgttc ataaqaaqta 
8460 y 

agaaaagctc tgatgtgagt tagcattgct ttacaatttt gaattgtgca gattgcacgt 
8520 y y 

acttttcctc agtttgaagt aaatagtgga caggaaaaaa tattaaatgt tggcaqtaaa 
8580 

tatggaagga aattacaact aatgtaatat gctaaaacat gctatgttta ttttactaat 
8640 

ttgaattaaa atgtaagaat ttaaaatgcc ctggaaaaac acgggcattg atctqacqtc 
8700 y 

tgaagtttta aaatattaca cactttgaaa tagcatttgt accttgaaat acctgtctct 

atatattttt taaaacttcc tttttctttc attccattta tcatcaaata aagqatqaac 
8820 y 

agatgtaact cagaaactgt caagcatgct gaagaaagac cactgcagaa aaatttctcc 
8880 

tagccttttc aaaggtgtta ggaagcagaa aggtgataca gaattggaga ggtcggagtt 
8940 

tttgtattaa ctgtattaaa tgcgaatccc gagaaaattt cccttaacta cqtcctqtaq 
9000 ^ 

tt 9060 9gat at9aagactt at 9 fc 9raactt tgaaagacgt gtctacataa gttgaaatgt 
cc ccaatgat tcagctgatg cgcgtttctc tacttgccct ttctagagag gtgcaacgga 

agccagaaca ttcctcctgg aaattcaacc tgtttcgcag tttctcgagg aatcagcatt 
9180 
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cagtcaatcc gggccgggag 
9240 



cgccggggcg tgggctgagc 
9300 



attcgagtag cggctcttcc 
9360 

tctttccact aaagtcggag 
9420 



ggagcgcgag gtaggggcac 
9480 

ggctttcaga tttcctgaac 
9540 

tgtgaagtcc tctggcaagt 
9600 

• cctggagcag cgcccaaacc 
9660 



cgcgcggtgc ggggacctgc 
9720 



cgcggcgact ggaaccggga 
9780 



tgccagaccc ccaactctgc 
9840 



ctgcagtgcc catcgcggcc 
9900 



gctctctggc ttcgacgggg 
9960 



tgcatttggc taatgagctg 
10020 



caatggagga gcaaagaaga 
10080 

tttcattttc atagtttaca 
10140 

ctgaaataaa tgacatgttg 
10200 
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cagtcatctg tggtgaggct 

acagccgctt cgctctcttt 

aagctcaaag aagcagaggc 

tatcttcttc caaaatttca 

gcaaagctgg gagctactat 

ttggtcttca cgggagaagg 

ccatggggac caagtggggt 

gtagtggcac tggaccatgt 

tctctgagcc cgcgggcggt 

gggagaatcg cactggcggc 

cttcgtggag atgctggaga 

agagcagctg gggcatcaac 

gactagaggt tagtctcacc 

cggtttctct tcaggtcgga 

agaacttttt taaactgaac 

tagttgcgag atttgagtaa 

ttgtttttaa ttatttttaa 
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gattggctgg gcaggaacag 
gccacaggaa gcctgagctc 
cgctgttcgt ttcctttagg 
cgtcttggtg gccgttccaa 
gggacagttc ccaagtgtca 
gcttcttgag gcgtggatag 
tagatctaga ctcaggagct 
tgcccggagc gcgcacagcc 
gggtgggagg aagcatcgtc 
gggcaaagtc cagaacgcgc 
ccccgcgcac aggaaagccc 
ggcgggcgct ccctcttact 
tccagcgcgc ctgaggctca 
atggatcttg aaggggaccg 
aataaaaggt aactagcttg 
tttatttcta gcctccagct 
gaaacgcaag ctagcctttg 
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